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Introduction 


Breast  cancer  is  a  common  disease  in  women  but  the  causes  are  still  largely  unknown.  There  is 
considerable  evidence  to  suggest  that  genetic  factors  play  an  important  role  in  causing  breast 
cancer.  In  recent  years,  our  understanding  of  genetic  predisposition  to  breast  cancer  has 
advanced  significantly.  Three  classes  of  predisposition  factor,  categorised  by  their  associated 
risks  of  breast  cancer,  are  currently  known.  BRCA1  and  BRCA2  are  high-penetrance  breast 
cancer  predisposition  genes  identified  by  genome-wide  linkage  analysis  and  positional  cloning. 
Mutational  screening  of  genes  functionally  related  to  BRCA1  and/or  BRCA2  has  revealed  four 
genes,  CHEK2,  ATM,  BRIP1  and  PALB2 ,  mutations  of  which  are  rare  and  confer  an 
intermediate  risk  of  breast  cancer.  Association  studies  have,  thus  far,  identified  ten  common 
variants  associated  with  low-penetrance  breast  cancer  predisposition  (1,2).  Our  group  has  had 
a  major  role  in  the  elucidation  of  breast  cancer  predisposition  identifying  BRCA2,  CHEK2 , 
BRIP1 ,  PALB2,  clarifying  the  role  of  ATM  in  breast  cancer  predisposition  and  contributing  to  the 
international  collaborative  efforts  to  identify  the  common,  low-penetrance  breast  cancer  alleles 
(3-8).  However,  despite  these  discoveries,  most  of  the  familial  risk  of  breast  cancer  remains 
unexplained. 

Our  aim  therefore  remains  the  identification  and  characterisation  of  the  genetic  factors  that 
increase  the  chance  of  breast  cancer  occurring.  We  have  collected  clinical  information  and 
samples  from  over  4000  breast  cancer  families  to  facilitate  these  aims.  Over  the  reporting 
period  we  have  been  engaged  to  three  main  areas,  outlined  in  further  detail  below  1) 
Undertaking  the  largest  genome-wide  association  study  (GWAS)  in  breast  cancer  to  date,  which 
includes  4000  familial  cases  and  6000  controls  in  the  first  phase  and  2)  Continuing  with  our 
candidate-gene  sequencing  analyses  of  DNA  repair  genes  which  previously  led  to  the 
identification  of  CHEK2,  ATM,  BRIP1  and  PALB2.  3)  Investigating  the  interactions  between 
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mutations  in  rare,  intermediate-penetrance  breast  cancer  predisposition  genes  and 
BRCA1/BRCA2  mutations.  An  important  focus  for  us  currently,  is  the  evaluation  and 
optimization  of  next-generation  sequencing  technologies  to  allow  us  to  take  the  candidate  gene 
sequencing  to  a  genome-wide  level,  which  we  hope  to  begin  within  the  next  reporting  period. 

Body 

As  part  of  the  programme  of  work  we  defined  five  tasks.  The  progress  towards  the  tasks  is 
outlined  in  detail  below.  As  genetics  is  a  fast-moving  field  and  there  have  been  considerable 
technological  advances  over  the  last  few  years,  we  have  revised  some  aspects  of  the  original 
tasks  to  ensure  that  we  deliver  our  aims  in  the  most  effective  and  efficient  manner  possible  (see 
below. 

Task  1:  Evaluate  the  contribution  of  BRCA1  and  BRCA2  exonic  deletions  and  duplications  to 
breast  cancer  susceptibility. 

This  task  is  complete  -  see  previous  report. 

Task  2.  Perform  famiiiai  case-control  analyses  of  non-synonymous  coding  single  nucleotide 
polymorphisms  (SNPs)  in  DNA  repair  genes  in  familial  breast  cancer  cases. 

This  task  is  complete  -  see  previous  report  and  paper  (9) 

Task  3.  Characterise  the  histopathology  and  immunohistochemistry  of  familial  breast  cancer. 

We  have  amended  this  task  slightly  to  reflect  the  new  insights  that  have  emerged.  We  are 
undertaking  and  have  achieved  the  following  in  this  reporting  period 
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a)  Accruing  data  on  the  ER  status  of  the  4000  cases  included  in  the  genome-wide 
association  study  so  that  we  can  analyse  the  data  by  ER  status. 

To  date  we  have  collected  this  information  on  678  cases.  We  are  also  collecting  data  on  PR  and 
HER2  and  other  pathological  characteristics  where  available.  However,  initial  GWAS  studies 
have  demonstrated  that  ER  status  is  the  strongest  phenotypic  surrogate  (10).  Therefore  we 
have  prioritized  obtaining  ER  status  on  as  many  cases  as  possible  for  the  primary  analysis  and 
plan  to  go  back  and  undertake  a  more  detailed  review,  and  ideally  construct  tissue  microarrays, 
at  a  later  date. 

b)  Collection  and  analysis  of  sporadic  breast  cancer  cases  with  triple-negative  tumors  (ER, 
PR  and  HER2  negative)  which  we  are  stratifying  by  BRCA1/2  status. 

Genetic  and  biological  data  indicate  that  triple-negative,  basal-like  tumors  are  a  distinctive  sub¬ 
phenotype  of  breast  cancers  that  may  have  different  underlying  causes  (10).  There  is  a  known 
strong  association  of  triple-negative  tumor  phenotype  and  BRCA1  mutations.  However,  the 
contribution  of  BRCA1  to  triple-negative  breast  cancer  in  the  absence  of  a  strong  family  history 
remains  unclear  and  is  a  source  of  considerable  confusion  diagnostically.  We  aim  to  address 
this  question  by  screening  substantial  (at  least  300)  individuals  with  triple-negative  tumors  for 
BRCA1  mutations.  Over  the  last  year  we  have  collected  over  150  cases  and  we  anticipate  that 
we  will  undertake  this  experiment  within  the  next  reporting  period.  The  BRGA-negative  triple¬ 
negative  cases  will  be  incorporated  into  our  ongoing  research  to  identify  new  genes. 

c)  Tumor  collection  and  pathological,  immunohistochemical  and  loss  of  heterozygosity 
analyses  to  define  the  tumor  characteristics  associated  with  the  rare,  intermediate- 
penetrance  breast  cancer  susceptibility  genes,  ATM,  BRIP1,  CHEK2,  PALB2. 
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To  date  we  have  90  cases  due  to  CHEK2  mutations,  19  with  ATM ,  19  with  BRIP1  and  17  with 
PALB2  mutations.  We  are  continuing  to  screen  these  genes  in  our  breast  cancer  families  to 
identify  additional  cases.  We  have  obtained  tumor  information  and/or  data  from  half  of  the 
cases.  We  plan  to  try  to  obtain  as  many  tumor  samples  as  possible  during  2009  and  will  then 
undertake  the  loss  of  heterozygosity  analyses  and  detailed  path  review  of  all  collected  cases  in 
2010  with  a  view  to  submitting  the  results  for  publication  towards  the  end  of  2010. 

Task  4.  Perform  genome-wide  familial  case-control  analyses  of  non-synonymous  coding  SNPs, 

This  task  is  completed  (as  originally  proposed)  and  published  -  see  previous  report  and  paper 
(9).  We  undertook  a  direct  analysis  of  the  known  (at  that  time)  15,000  non-synonymous  coding 
SNPs  (9).  Our  current  genome-wide  association  study  in  4000  cases  and  6000  controls  will 
additionally  allows  us  to  tag  any  non-synonymous  coding  SNPs  of  5%  or  greater  frequency  (see 
below)  and  will  therefore  will  allow  us  to  interrogate  the  remaining  non-synonymous  coding 
SNPs.  However,  our  understanding  of  this  area  has  changed  considerably  within  the  last  few 
years  and  we  have  thus  extended  and  revised  this  experiment.  Initially,  it  was  considered  most 
plausible  that  common  variation  underpinning  common  disease  would  most  likely  reside  in 
genes  and  that  non-synonymous  variation  would  be  the  most  likely  causative  alleles.  However, 
in  breast  cancer  (and  indeed  many  other  common  diseases)  very  few  of  the  common  low- 
penetrance  alleles  have  been  in  genes  and  indeed  some  are  in  gene  deserts  (2,  8).  Hence,  our 
ongoing  efforts  to  evaluate  common  variation  in  breast  cancer  susceptibility  are  encompassing 
the  full  genome  and  are  not  focusing  solely  on  coding  variants  (see  below). 

Task  5.  Identify  and  characterize  low-penetrance  breast  cancer  susceptibility  alleles 
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The  aim  of  this  task  remains  the  same.  However,  we  have  adapted  the  original  task  to  capitalize 
on  the  results  of  our  (and  other  researchers)  research  as  well  as  the  considerable  technological 
advances  that  have  occurred  over  the  last  few  years. 

a)  Undertake  a  second-generation  genome-wide  association  study  in  4000  cases  and  6000 
controls  to  identify  common,  low-penetrance  breast  cancer  susceptibility  alleles. 
Genome-wide  association  studies  (GWAS)  have  already  identified  novel  susceptibility  loci  for 
several  common  diseases.  In  2007,  we  completed  the  first  such  study  in  breast  cancer  in 
collaboration  with  Professor  Doug  Easton.  This  was  based  on  400  genetically  enriched  breast 
cancer  cases  and  400  controls  typed  for  over  220,000  SNPs.  These  SNPs  were  correlated  with 
-71%  of  known  common  SNPs,  at  r2>0.5.  Putative  associations  were  followed  up  in  -26,000 
cases  and  26,000  controls.  This  study  provided  clear  evidence  for  five  novel  breast  cancer 
susceptibility  loci  (8).  Further  studies  of  this  initial  scan  have  recently  led  to  the  identification  of 
two  further  variants  on  3p24  and  17q23  (11).  A  further  3  loci  have  been  identified  (Hunter  et  al, 
2007;  Stacey  et  al,  2007).  Together  these  10  loci  explain  about  6%  of  the  familial  risk  of  the 
disease,  so  that  in  total  approximately  25%  of  the  familial  risk  is  now  explained  (reviewed  in  (2)). 

Although  the  GWAS  studies  undertaken  to  date  have  been  successful  they  were  underpowered 
as  the  genome-wide  phase  was  undertaken  in  relatively  small  series.  Last  year,  on  the  basis  of 
our  previous  data  we  successfully  applied  for  a  grant  to  undertake  a  second-generation  GWAS 
scan  which  is  the  largest  undertaken  to  date  by  some  considerable  margin.  We  have  genotyped 
4000  familial  breast  cancer  cases  on  the  lllumina  670  SNP  chip  and  will  be  comparing  this  to 
genome-wide  data  from  6000  control  individuals.  The  grant  funded  the  genome-wide  genotyping 
of  cases  but  the  pre-genotyping  curation,  statistical  analyses  and  replication  will  be  undertaken 
as  part  of  our  EOH  initiatives.  During  this  reporting  period  we  spent  a  very  considerable  amount 
of  time  preparing  and  curating  the  samples  prior  to  the  genotyping.  We  sent  the  samples  in 


November  and  we  have  just  received  the  data.  We  will  be  undertaking  the  statistical  analyses 
both  for  standard  SNP  associations  and  for  copy  number  variants  (CNVs)  and  replication  of 
positive  associations  over  the  next  6-12  months.  We  will  replicate  the  top  30  SNPs  in-house  in 
3500  additional  cases  and  3500  controls  using  Taqman.  We  will  also  then  contribute  the 
genome-wide  data  to  an  international  meta-analysis  that  will  hopefully  include  data  from  all  the 
breast  cancer  GWAS  scans  performed  to  date. 


The  power  calculations  for  the  experiment  suggest  that  our  GWAS  scan  will  provide  better  than 
80%  power  to  detect  any  locus  conferring  a  per  allele  relative  risk  of  1.1,  provided  the  allele 
frequency  is  at  least  10%,  and  any  allele  with  a  frequency  of  5%  conferring  a  relative  risk  of  1.2. 
The  calculations  assume  the  expected  enrichment  of  the  stage  1  cases  by  virtue  of  their  family 
history,  and  are  therefore  assume  that  interactions  between  loci  are  approximately 
multiplicative.  The  power  is  principally  dependent  on  the  familial  relative  risk  conferred  by  the 
SNP,  and  power  is  better  than  90%  to  detect  any  locus  explaining  at  least  0.2%  of  the  familial 
risk. 


b)  Undertake  case-control  resequencing  of  candidate  DNA  repair  genes  to  identify  further 
rare,  low-penetrance  genes. 

In  the  last  year  we  have  particularly  focused  on  evaluating  the  DNA  repair  gene,  GEN1 .  GEN1 
is  a  recently  identified  Holliday  junction  resolvase  with  an  important  role  in  recombinational 
repair  (14).  As  such  it  is  potentially  important  in  double-strand  break  repair  via  the  homologous 
recombination  pathway.  The  breast  cancer  predisposition  genes  BRCA1,  BRCA2 ,  ATM,  TP53, 
CHEK2,  BRIP1  and  PALB2  are  all  involved  in  these  pathways  (reviewed  in  Turnbull  and 
Rahman,  2008).  In  addition,  somatic  truncating  GEN1  mutations  in  two  breast  cancer  cell  lines 
have  been  reported  (15).  Taken  together,  this  recommended  GEN1  as  a  plausible  candidate 


breast  cancer  predisposition  gene.  To  investigate  this  we  screened  the  full  coding  sequence 
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and  intron-exon  boundaries  of  GEN1  in  constitutional  DNA  from  192  familial  breast  cancer 


cases  (that  are  negative  for  mutations  in  BRCA1  and  BRCA2)  and  192  controls. 

We  identified  a  five  base  pair  deletion,  c.2515_2519delAAGTT.  We  extended  analysis  of  this 
truncation  to  540  cases  and  controls.  The  mutation  was  present  in  47/1072  case  chromosomes 
and  47/1050  control  chromosomes  successfully  analysed.  Hence  there  was  no  evidence  that 
this  variant  conferred  an  excess  risk  of  breast  cancer  (P=0.9).  The  mutation  is  predicted  to 
result  in  a  protein  product  lacking  the  terminal  69  amino  acids  of  GEN1  and  we  confirmed  this  in 
cDNA  analyses.  Furthermore,  we  identified  case  and  control  individuals  homozygous  for  the 
truncating  mutation,  indicating  that  the  last  69  amino  acids  of  GEN1  are  dispensable  for  its 
function. 

We  also  identified  7  synonymous  and  12  non-synonymous  variants.  None  were  predicted  to  be 
pathogenic  by  in  silico  analyses.  We  detected  14  of  these  variants  at  similar  frequencies  in 
cases  and  controls  with  four  being  common  in  both  groups.  Three  rare  non-synonymous 
variants  were  found  in  cases  but  not  controls  and  two  rare  non-synonymous  variants  were  found 
in  controls  but  not  cases. 

These  data  indicate  that,  despite  the  crucial  role  of  GEN1  in  recombinational  repair  and  the 
identification  of  somatic  GEN1  mutations  in  breast  cancer  cell  lines,  GEN1  is  not  a  breast 
cancer  predisposition  gene.  It  also  highlights  the  importance  of  undertaking  suitable  genetic 
experiments,  including  full  analysis  of  a  candidate  gene  in  controls,  when  evaluating  the 
potential  role  of  a  gene  in  conferring  susceptibility  to  disease.  We  are  preparing  these  data  for 
publication. 
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c)  Evaluate  novel  breast  cancer  susceptibility  alleles  in  BRCA1  /  BRCA2  families  to 
determine  whether  they  modify  or  interact  with  these  genes  in  breast  cancer. 

The  issue  of  epistatis  between  breast  cancer  susceptibility  alleles  is  a  topical  and  important 
area,  particularly  as  the  recently  identified  variants  individually  confer  very  small  increases  in 
risk.  Preliminary  analyses  from  genome-wide  association  studies  suggest  that  the  common,  low- 
penetrance  breast  cancer  susceptibility  alleles  act  multiplicatively  with  each  other  (8).  However, 
association  studies  performed  within  BRCA1  and  BRCA2  mutation-positive  families  suggest  that 
at  least  some  of  these  SNPs  confer  additional  risk  in  the  presence  of  BRCA2  but  not  BRCA1 
mutations,  although  this  may  primarily  reflect  the  association  of  the  SNPs  with  ER  positive 
tumors  (as  BRCA1  is  typically  associated  with  ER  negative  tumors)  (16,10). 

We  have  been  evaluating  the  interaction  of  the  rare,  intermediate-penetrance  breast  cancer 
genes  with  BRCA1  and  BRCA2.  Such  studies  are  challenging  because  of  the  rarity  of  mutations 
and  the  requirement  to  screen  substantial  numbers  of  cases  through  the  complete  coding 
sequence  to  identify  them.  Within  the  last  year  we  have  completed  our  analyses  of  ATM  and 
CHEK2  in  BRCA1  and  BRCA2  mutation-positive  individuals. 

We  identified  CHEK2  mutations  in  90/4741  familial  BRCA  1/2-n eg ative  breast  cancer  families 
compared  to  8/2045  controls  (P=2. 9x10-7)  and  1/345  BRCA  1-positive  families  (P=0.01)  and 
1/300  BRC/42-positive  families  P=0.02).  Similarly  we  identified  ATM  mutations  in  19/1436 
familial  BRCA1/2  negative  breast  cancer  families  compared  to  0/273  BRCA  '/-positive  families 
(P=0.01)  and  1/335  BRC/42-positive  families  (P=0.02). 

These  data  confirm  and  extend  our  initial  data  identifying  ATM  and  CHEK2  as  breast  cancer 
predisposition  genes  and  demonstrates  a  negative  interaction  with  BRCA1  and  BRCA2.  i.e. 
mutations  in  ATM  and  CHEK2  do  not  confer  an  increased  risk  of  breast  cancer  on  the 
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background  of  BRCA1  or  BRCA2  mutations.  This  is  plausibly  because  they  act  in  similar 
biological  pathways  and  thus  if  the  pathway  is  already  subverted  by  BRCA1  or  BRCA2 
mutations  there  is  no  additional  advantage  conferred  by  mutations  in  additional  DNA  repair 
genes.  These  results  are  different  to  the  common,  low-penetrance  breast  cancer  SNP  which  do 
confer  an  increased  risk  in  BRCA2  carriers  (16). 
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Key  Research  Accomplishments 

In  this  reporting  period  we  have  achieved  the  following: 

•  We  published  a  review  entitled  Breast  Cancer  Predisposition  -  Past,  Present  and  Future  in 
Annual  Review  of  Genomics  and  Human  Genetics,  the  leading  review  journal  in  this  area. 
We  were  notified  in  Dec  2008  that  our  article  was  in  the  top  5  to  be  accessed  from  their 
website  and  that  the  abstract  had  been  accessed  1127  times  and  the  PDF  has  been 
downloaded  647  times. 

•  We  have  performed  a  Genome-wide  association  scan  in  4000  familial  breast  cancer  cases 
and  6000  controls. 


Reportable  Outcomes 

1.  Turnbull  C  and  Rahman  N  (2008)  Genetic  predisposition  to  Breast  cancer:  past,  present 
and  future.  Annual  Review  of  Genetics  and  Genomics  9:321-45 

2.  Rahman  N,  invited  speaker  at  American  Society  of  Human  Genetics  session  on  Breast 
Cancer  Genetics,  Philadelphia,  2008. 

3.  Based  in  part  on  the  work  supported  by  this  award  we  successfully  applied  to  the 
Wellcome  Trust  to  fund  our  second  generation  genome-wide  association  study  - 
£655,500,  which  is  currently  underway. 

4.  Dr  Muna  Ahmed,  a  clinical  research  fellow  supported  on  this  programme  was  awarded 
her  research  doctoral  degree  based  on  the  research  she  undertook  as  part  of  this 
award.  She  subsequently  successfully  applied  for  a  Specialist  clinical  fellowship  in 
Cancer  Genetics. 
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Conclusion 


We  have  had  another  productive  year.  A  considerable  part  of  this  reporting  period  was  spent 
putting  together  the  4000  familial  samples  for  our  GWAS  scan  and  we  complete  and  replicate 
the  findings  from  this  in  the  next  reporting  period.  We  have  continued  our  sequencing 
endeavors  to  clarify  the  role  of  GEN1  in  breast  cancer  susceptibility  and  to  investigate  the 
interactions  of  rare,  intermediate-penetrance  breast  cancer  susceptibility  alleles  with  BRCA1 
and  BRCA2  mutations.  This  has  revealed  interesting  differences  to  the  common,  low- 
penetrance  alleles.  In  the  next  year  we  aim  to  begin  exome-wide  sequencing  analyses  using 
next  generation  sequencing  technologies  to  continue  to  investigate  the  genetic  basis  of  breast 
cancer. 
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Abstract 

In  recent  years,  our  understanding  of  genetic  predisposition  to  breast 
cancer  has  advanced  significantly.  Three  classes  of  predisposition  fac¬ 
tors,  categorized  by  their  associated  risks  of  breast  cancer,  are  currently 
known.  BRCA1  and  BRCA2  are  high-penetrance  breast  cancer  predis¬ 
position  genes  identified  by  genome-wide  linkage  analysis  and  posi¬ 
tional  cloning.  Mutational  screening  of  genes  functionally  related  to 
BRCA1  and/or  BRCA2  has  revealed  four  genes,  CHEK2,  ATM,  BRIP1, 
and  PALB2-,  mutations  in  these  genes  are  rare  and  confer  an  intermedi¬ 
ate  risk  of  breast  cancer.  Association  studies  have  further  identified  eight 
common  variants  associated  with  low-penetrance  breast  cancer  predis¬ 
position.  Despite  these  discoveries,  most  of  the  familial  risk  of  breast 
cancer  remains  unexplained.  In  this  review,  we  describe  the  known  ge¬ 
netic  predisposition  factors,  expound  on  the  methods  by  which  they 
were  identified,  and  consider  how  further  technological  and  intellec¬ 
tual  advances  may  assist  in  identifying  the  remaining  genetic  factors 
underlying  breast  cancer  susceptibility. 
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Linkage  analysis:  a 
statistical  method  for 
evaluating  linkage 
between  disease  and 
markers  of  known 
location  by  following 
their  inheritance  in 
families 

Penetrance:  the 
probability  that  a 
particular 

phenotype/disease  is 
expressed  in  an 
individual  with  a 
particular  genotype 


522 


INTRODUCTION 

Through  the  dynamic  interplay  of  multiple 
approaches,  the  past  two  decades  have  wit¬ 
nessed  the  gradual  emergence  of  a  clearer  un¬ 
derstanding  of  genetic  susceptibility  to  breast 
cancer.  Clinical  observation  underpinned  these 
advances  through  recognition  of  unusual  famil¬ 
ial  clustering  and  phenotypes  associated  with 
breast  cancer.  Observational  epidemiology  has 
also  provided  essential  foundations  to  our  un¬ 
derstanding,  both  in  quantifying  the  contri¬ 
bution  of  genetic  factors  to  breast  cancer  and 
in  predicting  how  these  factors  will  act  indi¬ 
vidually  and  interact.  Genetic  modeling  has 
been  performed  to  predict  the  profiles  of  the 
genes  involved,  which  in  turn  facilitates  the 
selection  of  molecular  methods  appropriate 
for  identifying  them.  Linkage  analysis,  muta¬ 
tional  screening  of  candidate  genes,  and  as¬ 
sociation  studies  have  been  used  to  identify 
predisposition  factors  of  three  distinct  risk- 
prevalence  profiles:  rare  high-penetrance  alle¬ 
les,  rare  intermediate-penetrance  alleles,  and 
common  low-penetrance  alleles. 


OBSERVATIONAL 
EPIDEMIOLOGY  AND 
SEGREGATION  ANALYSIS 

Epidemiological  observation  of  the  clustering 
of  breast  cancer  within  families  was  made  in 
Roman  times  and  reiterated  by  other  early 
socio-scientific  commentators  (22,  77).  More 
than  50  studies  have  explored  this  familial  ag¬ 
gregation  of  breast  cancer  using  predominantly 
case-control  and  cohort  designs.  Meta-analyses 
of  these  data  conclude  that,  overall,  breast  can¬ 
cer  is  twice  as  common  in  women  with  an 
affected  first-degree  relative.  Simulation  stud¬ 
ies  suggest  that  an  environmental  factor  would 
have  to  confer  at  least  a  tenfold  increase  in  risk 
for  shared  exposure  to  result  in  even  a  mod¬ 
est  increase  to  the  familial  risk  (65).  Because 
no  environmental  risk  factors  of  this  magnitude 
have  been  identified  for  breast  cancer,  it  seems 
unlikely  that  shared  environment  accounts  for 
much  of  the  familial  aggregation.  Heritability 


studies  can  assist  in  clarifying  the  relative  con¬ 
tribution  of  genetic  factors  and  shared  envi¬ 
ronment.  Via  demonstration  that  the  risk  to  a 
monozygotic  twin  is  substantially  higher  than 
to  a  dizygotic  twin  of  an  affected  individual, 
heritability  studies  using  twins  confirmed  that 
the  predominant  component  of  the  familial  ag¬ 
gregation  in  breast  cancer  is  genetic  (80,  93). 
Patterns  of  disease  aggregation  in  twins  provide 
evidence  not  just  for  a  genetic  basis  for  breast 
cancer  but  demonstrate  a  markedly  skewed  dis¬ 
tribution  of  genetic  liability,  suggesting  that  the 
majority  of  genetic  risk  may  lie  within  a  genet¬ 
ically  predisposed  minority  (93). 

Segregation  analysis  (genetic  modeling)  in¬ 
volves  the  simulation  of  various  scenarios 
whereby  disease  occurs  owing  to  combinations 
of  known  or  hypothetical  genes  of  specified 
risk,  prevalence,  and  mode  of  inheritance.  The 
patterns  of  cancers  predicted  by  the  different 
models  are  compared  with  observed  disease 
frequencies  and  the  best-fitting  model  is  fa¬ 
vored.  Segregation  analyses  have  strongly  in¬ 
fluenced  the  molecular  approaches  with  which 
genetic  predisposition  factors  have  been  in¬ 
vestigated.  The  Cancer  and  Steroid  Hormone 
Study  data  and  other  early  segregation  anal¬ 
yses  of  families  with  breast  cancer  favored  a 
highly  penetrant  autosomal  dominant  genetic 
model  (16, 29, 68, 12 1).  This  prediction  was  val¬ 
idated  through  linkage  analysis  and  the  identifi¬ 
cation  of  the  highly  penetrant  autosomal  domi¬ 
nant  breast  cancer  predisposition  genes,  BRCA1 
and  BRCA2.  Subsequent  complex  segregation 
analyses  have  incorporated  the  effects  of 
BRCA1  and  BRCA2  and  explored  the  evidence 
for  additional  genes  (8,  9,  35,  89).  Proba¬ 
bly  the  most  comprehensive  of  these  analy¬ 
ses,  based  upon  an  extensive  series  comprising 
both  population-based  cases  of  breast  cancer 
and  large  familial  clusters,  has  strongly  favored 
the  polygenic  model.  The  polygenic  compo¬ 
nent  of  risk  in  this  model  is  equivalent  to  mul¬ 
tiple  genetic  factors  acting  independently:  This 
component  is  log-normally  distributed,  has  a 
variance  that  declines  linearly  with  age,  and  ap¬ 
plies  similarly  to  BRCA  mutation  carriers  and 
noncarriers  (8-10).  The  polygenic  model  is  also 
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consistent  with  multiple  observations  that  the 
excess  of  familial  breast  cancer  is  distributed 
across  many  families,  each  typically  comprising 
a  modest  number  of  cases,  rather  than  just  a  few 
very  extensive  families  (9,  35).  The  polygenic 
model  has  largely  been  accepted  as  an  appo¬ 
site  explanation  for  the  residual  breast  cancer 
predisposition  and  has  been  validated  in  part 
by  identification  of  a  number  of  the  incumbent 
polygenes. 

STRATEGIES  FOR 
IDENTIFYING  BREAST  CANCER 
PREDISPOSITION  FACTORS 

Three  principal  experimental  designs  have  been 
used  in  the  molecular  identification  of  genetic 
breast  cancer  predisposition  factors:  genome¬ 
wide  linkage  analysis,  mutational  screening  of 
candidate  genes,  and  association  studies. 

Linkage  Analysis 
and  Positional  Cloning 

Linkage  studies  are  used  to  map  a  disease  lo¬ 
cus  via  analysis  for  cosegregation  of  genomic 
markers  with  a  specified  disease  phenotype  us¬ 
ing  samples  from  multiple  members  of  large 
families.  The  region  of  the  genome  surround¬ 
ing  the  linked  markers  is  then  interrogated 
for  likely  causative  genes  (positional  cloning). 
Linkage  analysis  is  suitable  for  mapping  only 
high-penetrance  breast  cancer  predisposition 
genes,  whereby  mutations  result  in  prominent 
cancer  families  in  which  the  majority  of  af¬ 
fected  individuals  carry  the  mutation.  If  a  gene 
is  of  lower  penetrance,  the  correlation  between 
breast  cancer  and  mutation  status  in  mutation¬ 
positive  families  may  be  insufficient  to  generate 
a  linkage  signal.  Nor  will  a  significant  signal  be 
detectable  if  the  breast  cancer  is  linked  to  a  par¬ 
ticular  locus  in  only  a  small  proportion  of  the 
families  analyzed. 

Linkage  maps  and  sets  of  markers  that  ad¬ 
equately  cover  the  genome  became  available 
in  the  late  1980s.  Using  large  breast  cancer 
pedigrees,  genome-wide  linkage  analyses  were 
undertaken  to  map  the  high-penetrance  breast 


cancer  susceptibility  genes  proposed  by  the  seg¬ 
regation  analyses,  BRCA1  andBRCA2(6\,  123). 
Linkage  analysis  has  also  been  performed  in 
groups  of  phenotypically  distinct  families  to 
map  the  loci  underlying  specific  syndromes  that 
include  an  elevated  risk  of  breast  cancer.  Po¬ 
sitional  cloning  was  then  used  to  identify  the 
causative  genes,  such  as  PTEN,  STK11,  and 
CDH1  (58,63,64,  70,87,88). 

Resequencing  Studies:  Mutational 
Screening  of  Candidate  Genes 

During  the  1990s,  emerging  understanding  of 
the  molecular  pathogenesis  of  breast  cancer 
offered  insights  into  possible  candidate  genes 
that  predispose  to  breast  cancer.  It  seemed  bi¬ 
ologically  plausible  that  proteins  interacting 
with  BRCA1  and  BRCA2  or  acting  in  similar 
DNA  repair  pathways  may  also  be  involved  in 
breast  cancer  susceptibility.  A  few  deleterious 
mutations  were  found  at  frequencies  amenable 
to  evaluation  by  association  studies.  However, 
in  the  majority  of  DNA  repair  genes  studied, 
disease-causing  mutations  (primarily  leading 
to  premature  protein  truncation  or  nonsense- 
mediated  RNA  decay)  have  been  individually 
very  rare.  A  robust  demonstration  that  such 
genes  confer  predisposition  to  breast  cancer  has 
therefore  required  mutational  screening  of  the 
entire  coding  sequence  of  the  gene  through 
large  numbers  of  cases  and  controls  to  meaning¬ 
fully  compare  the  total  number  of  pathogenic 
mutations.  Experiments  of  this  magnitude  have 
been  published  for  only  a  handful  of  DNA  re¬ 
pair  genes.  Other  postulated  candidate  genes, 
such  as  those  implicated  in  cell  cycle  regulation, 
checkpoint  control,  apoptosis,  and  steroid  hor¬ 
mone  metabolism,  have  rarely  been  evaluated 
to  this  level. 

Association  Studies 

In  an  association  study,  the  frequency  of  a  spec¬ 
ified  variant  is  compared  between  breast  can¬ 
cer  cases  and  controls.  Statistically  significant 
differences  in  allele  frequency  between  cases 
and  controls  are  more  readily  demonstrable  for 
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Single  nucleotide 
polymorphism 
(SNP):  a  single-base 
sequence  variation  that 
commonly  occurs  at  a 
particular  position 
within  the  genome 

Linkage 

disequilibrium:  the 
non-random 
association  of  alleles  at 
two  or  more  loci, 
which  suggests  that 
they  may  be  physically 
close  and  thus  linked 


3H 


variants  of  population  frequency  >5%;  hence 
association  studies  are  most  suitable  for  iden¬ 
tification  of  common  breast  cancer  predisposi¬ 
tion  variants.  Initial  studies  focused  on  genes 
proposed  by  function  and  examined  the  associ¬ 
ation  with  breast  cancer  of  numerous  common 
variants  from  recognized  and  candidate  predis¬ 
position  genes.  However,  experiments  were  of¬ 
ten  too  small,  subject  to  bias,  and  utilized  too 
lenient  levels  of  significance,  resulting  in  incon¬ 
sistency  and  lack  of  replication  of  findings.  The 
majority  of  putative  associations  likely  repre¬ 
sented  false  positives  (type  1  errors)  (40,  95).  To 
increase  power  and  rationalize  choices  of  can¬ 
didates,  collaborations  have  formed  that  under¬ 
take  association  studies  across  tens  of  thousands 
of  samples  (19). 

The  emergence  of  comprehensive  high- 
density  maps  of  single  nucleotide  polymor¬ 
phisms  (SNPs)  and  affordable  genotyping 
platforms  has  allowed  the  graduation  of  asso¬ 
ciation  studies  from  the  limitations  of  precon¬ 
ceived  notions  of  candidacy  to  the  agnosticism 
of  the  genome-wide  approach.  On  account  of 
linkage  disequilibrium,  a  panel  of  a  few  hundred 
thousand  reporter  SNPs  can  be  used  as  tags  for 
the  majority  of  the  millions  of  common  vari¬ 
ants  in  the  genome.  Accordingly,  owing  to  the 
degree  of  multiple  testing,  a  genome-wide  scan 
must  be  sufficiently  well-powered  to  ensure  that 
the  true  associations  are  detected.  The  staged 
experimental  design  is  a  means  of  optimizing 
the  statistical  power  afforded  by  a  given  sample 
size  and  has  been  useful  in  studies  to  date  be¬ 
cause  of  the  current  high  costs  of  whole  genome 
tag  SNP  panels. 


BREAST  CANCER 
PREDISPOSITION 
GENES  AND  VARIANTS 

The  breast  cancer  predisposition  factors 
identified  to  date  can  be  stratified  by 
risk  profile  into  three  tiers:  high-penetrance 
genes,  intermediate-penetrance  genes,  and 
low-penetrance  alleles.  Three  further  genes  are 
associated  with  syndromes  in  which  the  inci¬ 


dence  of  breast  cancer  is  elevated  but  the  actual 
risk  remains  unclear  (Table  1). 

High-Penetrance  Breast  Cancer 
Predisposition  Genes 

Mutations  in  three  high-penetrance  breast  can¬ 
cer  predisposition  genes  confer  a  greater  than 
tenfold  relative  risk  of  breast  cancer.  BRCA1 
and  BRCA2  were  identified  through  linkage 
analysis  and  positional  cloning.  TP53  was 
deemed  a  plausible  candidate  and  identified  as 
a  high-risk  breast  cancer  gene  through  muta¬ 
tional  screening. 

BRCA1  and  BRCA2.  The  first  convincing 
report  of  linkage  of  breast  cancer  to  17q21 
was  published  in  1990  (61).  In  1994,  positional 
cloning  revealed  the  causative  gene,  accord¬ 
ingly  named  BRCA1  (86).  Linkage  analysis  and 
positional  cloning  led  to  the  mapping  and  iden¬ 
tification  of  BRCA2  in  1994  and  1995,  respec¬ 
tively  (122,  123).  BRCA1  and  BRCA2  have  im¬ 
portant  roles  in  the  maintenance  of  genomic 
stability  by  facilitating  repair  of  DNA  double¬ 
strand  breaks.  The  cellular  roles  of  BRCA1  and 
BRCA2  were  reviewed  by  Gudmundsdottir  & 
Ashworth  (56). 

BRCA1  and  BRCA2  are  large  genes  in 
which  multiple  different  loss-of-function 
mutations  have  been  detected.  Some  founder 
mutations  are  relatively  frequent  in  par¬ 
ticular  ethnic  groups,  such  as  BRCA1_ 
185delAG,  BRCA1. 5382insC,  and  BRCA2. 
6174delT  in  the  Ashkenazim  and  BRCA2_ 
999del5  in  Icelanders.  However,  the  majority 
of  mutations  are  individually  rare  and  many 
have  been  reported  only  in  single  families. 
Most  recognized  disease-associated  mutations 
result  in  premature  protein  truncation  and  in¬ 
clude  nonsense  mutations,  deletions/insertions 
that  result  in  translational  frameshifts,  and 
mutations  that  affect  splice  sites.  More  recently 
a  number  of  exonic  deletions/duplications 
have  also  been  identified,  especially  in  BRCA1. 
In  addition,  a  large  number  of  amino  acid 
substitutions  and  synonymous  nucleotide 
substitutions  in  BRCA1  and  BRCA2  have  been 
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Table  1  Summary  of  known  breast  cancer  predisposition  factors 
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detected,  the  majority  of  which  are  innocuous. 
A  few  missense  mutations,  for  example  those 
that  target  cysteine  residues  in  the  BRCA1 
RING  domain,  abrogate  function  and  are  pre¬ 
sumed  to  confer  risks  comparable  to  truncating 
mutations  (43,  106). 

There  is  evidence  of  genotype-phenotype 
correlations  for  mutations  in  BRCA1  and 
BRCA2.  In  both  genes,  the  ratio  of  ovarian  can¬ 
cers  to  breast  cancers  conferred  by  mutations  in 
the  central  region  of  the  gene  is  higher  than  that 
for  mutations  in  the  5'  or  3'  end.  In  BRCA1  this 
central  region  is  bounded  by  nucleotides  2401 
and  4191;  in  BRCA2  the  ovarian  cancer  clus¬ 
ter  region  lies  between  nucleotides  3035  and 
6629  (114,  115).  The  biological  mechanisms 
underlying  these  observations  remain  opaque 
(120). 

BRCA1  and  BRCA2  are  high  penetrance 
breast  cancer  genes.  Estimates  of  the  risks  of 
cancer  conferred  by  mutations  in  these  genes 
vary  according  to  the  ascertainment  of  the  cases 
studied.  Early  studies  of  large  cancer  families 
suggested  that  the  risk  of  breast  cancer  by  age 
70  may  be  as  high  as  87%  [95%  confidence  in¬ 
terval  (Cl)  =  72%-95%]  for  BRCA1  and  84% 
(43%-95%)  for  BRCA2  mutation  carriers,  al¬ 
though  the  upper  estimates  were  based  on  rel¬ 
atively  small  numbers  of  families  (44,  51).  In 
population-based  studies  of  breast  cancer  cases, 
unselected  for  family  history,  the  risks  are  lower: 
65%  (5 1  %— 75%)  for  BRCA1  and  45%  (33%- 
54%)  for  BRCA2  (5).  The  pattern  of  age-related 
risk  in  BRCA2  mutation  carriers  resembles  that 
of  the  general  population  (only  higher).  By 
contrast,  the  relative  risk  of  breast  cancer  is 
markedly  elevated  in  BRCA1  mutation  carriers 
under  40  and  becomes  less  dramatic  with  ad¬ 
vancing  age  (5).  Linkage  analysis  suggests  that 
mutations  in  BRCA1  and  BRCA2  are  responsi¬ 
ble  for  disease  in  approximately  two-thirds  of 
large  families  with  site-specific  female  breast 
cancer  (>  four  cases)  but  the  attribution  dimin¬ 
ishes  sharply  for  smaller  family  clusters  (51). 
The  estimated  population  frequency  of  muta¬ 
tions  in  these  genes  is  approximately  1/1000 
per  gene  in  the  United  Kingdom.  Overall  this 


equates  to  1 5  %— 20%  of  the  excess  familial  risk 
of  breast  cancer  (4,  7,  9,  42,  91). 

BRCA1  and  BRCA2  are  also  high- 
penetrance  ovarian  cancer  genes:  Mutations 
in  BRCA1  confer  a  higher  risk  of  ovarian 
cancer  than  those  in  BRCA2,  particularly  for 
carriers  below  50  years  of  age  (5,  7,  44,  51). 
BRCA1  and  BRCA2  mutations  account  for 
most  of  the  epidemiologically  observed  familial 
coaggregation  of  breast  and  ovarian  cancer  (45, 
92).  95%  of  families  containing  four  or  more 
breast  cancers  and  an  ovarian  cancer  were 
linked  to  BRCA1  or  BRCA2,  whereas  recent 
mutational  screening  detected  mutations  in 
83  %  of  families  containing  at  least  two  cases  of 
each  cancer  (51,  99). 

The  relative  risk  of  male  breast  cancer  is  el¬ 
evated  for  both  genes,  particularly  BRCA2.  An 
elevated  risk  of  prostate  cancer  has  also  been 
demonstrated  in  BRCA2  carriers,  particularly 
in  men  aged  <65  years  (21,  116).  Small  ex¬ 
cesses  of  a  number  of  other  cancers  have  been 
observed  in  monoallelic  (heterozygous)  BRCA1 
and  BRCA2  mutation  carriers  but  larger  stud¬ 
ies  are  required  to  clarify  whether  these  find¬ 
ings  reflect  truly  elevated  risks  of  these  cancers 
(21,116).  Biallelic  mutations  in  BRCA2  result  in 
Fanconi  anemia,  subtype  Dl.  Fanconi  anemia 
is  a  rare,  recessive  chromosomal  instability  syn¬ 
drome  characterized  by  skeletal  abnormalities, 
bone  marrow  failure,  and  cancer  predisposition. 
Subtype  D 1  has  a  distinctive  phenotype  that  in¬ 
cludes  a  high  risk  of  childhood  solid  tumors 
(66).  Biallelic  BRCA1  mutations  have  never 
been  reported  convincingly  in  humans  and  are 
presumed  to  be  embryonically  lethal  (49). 

BRCA1  tumors  are  typically  high  grade,  in¬ 
vasive  ductal  carcinomas  in  which  there  is  a  high 
incidence  of  triple  negative  phenotype:  nega¬ 
tive  staining  for  ER  (estrogen  receptor),  PR 
(progesterone  receptor),  and  HER2  (ERBB2) 
(20,  73,  75).  These  tumors  also  frequently  stain 
positively  for  a  subset  of  basal  keratins  char¬ 
acteristically  expressed  in  the  normal  basal 
myoepithelium  of  the  breast.  This  basal  phe¬ 
notype  is  distinctive  and  largely  encompasses 
the  previous  histological  descriptions  of  BRCA1 


326  Turnbull  •  Rahman 


Annu.  Rev.  Genom.  Human  Genet.  2008.9:321-345.  Downloaded  from  arjoumals.annualreviews.org 
by  Dr  Clare  Turnbull  on  09/12/08.  For  personal  use  only. 


tumors  as  medullary  (69,  74).  No  distinctive 
histopathological  features  have  been  described 
in  BRCA2  tumors. 

TP53.  Li-Fraumeni  syndrome  is  a  cancer  pre¬ 
disposition  syndrome  in  which  there  is  a  high 
frequency  of  early  onset  breast  cancer  found  in 
association  with  sarcomas  and  childhood  can¬ 
cers  of  the  adrenal  cortex,  brain,  and  other  sites. 
Although  of  a  penetrance  sufficient  for  mapping 
through  linkage  analysis,  the  associated  early 
mortality  and  rarity  of  the  condition  impeded 
collection  of  sufficient  familial  samples.  p5  3  was 
recognized  early  as  a  prominent  transcription 
factor  central  to  multiple  cellular  pathways  and 
is  frequently  somatically  mutated  in  tumors. 
These  observations  recommended  TP53  as  a 
plausible  candidate  gene  for  Li-Fraumeni  syn¬ 
drome  and  in  1990  mutational  screening  of  the 
gene  revealed  causative  mutations  in  the  five 
families  studied  (82).  The  overall  lifetime  can¬ 
cer  risk  for  women  with  Li-F raumeni  syndrome 
is  grossly  elevated,  predominantly  on  account 
of  their  high  risk  of  breast  cancer  (15,  28). 
Li-Fraumeni  syndrome  is  rare  and  mutations 
in  TP53  are  uncommon  in  non-Li-Fraumeni 
breast  cancer  families  (18,  48,  76).  Thus,  the 
attributable  risk  of  TP53  mutations  to  familial 
breast  cancer  is  very  low. 

Breast  Cancer  Predisposition  Genes 
of  Uncertain  Penetrance 

Three  syndromes  are  currently  clearly  associ¬ 
ated  with  an  increased  risk  of  breast  cancer,  but 
the  magnitude  of  the  associated  risk  for  each 
remains  uncertain:  Cowden  syndrome  (caused 
by  PTEN  mutations),  Peutz-Jeghers  syndrome 
(caused  by  STK11  mutations),  and  Heredi¬ 
tary  diffuse  gastric  cancer  syndrome  (caused  by 
CDH1  mutations). 

PTEN.  Cowden  syndrome  is  a  multiple 
hamartoma  syndrome  that  includes  increased 
risk  of  benign  and  malignant  tumors  of  the 
breast,  thyroid,  and  endometrium;  distinctive 
higher-penetrance  features  include  mucocuta¬ 
neous  lesions,  macrocephaly,  and  hamartoma- 


tous  intestinal  polyps.  A  study  of  12  Cowden 
syndrome  families  revealed  linkage  to  chromo¬ 
some  lOq,  leading  to  identification  of  PTEN 
as  the  causative  gene  (87,  88).  PTEN  encodes  a 
lipid  phosphatase  that  functions  as  a  tumor  sup¬ 
pressor  through  negative  regulation  of  a  cell- 
survival  signaling  pathway.  There  is  cross  talk 
between  this  PTEN-related  pathway  and  other 
pathways,  including  those  involving  Ras,  p53, 
and  TOR  (36). 

STK1 1 .  Peutz-Jeghers  syndrome  is  character¬ 
ized  by  hamartomatous  intestinal  polyps,  mu¬ 
cocutaneous  pigmentation,  and  increased  inci¬ 
dence  of  several  malignancies,  including  breast 
cancer.  Directed  by  patterns  of  loss  of  heterozy¬ 
gosity  in  polyps  of  affected  individuals,  studies 
in  12  Peutz-Jeghers  families  established  link¬ 
age  to  chromosome  19p  and  positional  cloning 
led  to  the  identification  of  STK11  ( LKB1 )  as 
the  causative  gene.  STK1 1  is  a  serine/threonine 
kinase  that  inhibits  cellular  proliferation,  con¬ 
trols  cell  polarity,  and  interacts  with  the  TOR 
pathway  (1,  63,  64,  70). 

CDH1.  Linkage  analysis  in  a  single  New 
Zealand  Maori  family  with  multiple  cases  of 
diffuse  gastric  cancer  enabled  the  identification 
of  CDH1  ( ECAD )  as  the  responsible  gene  (58). 
Many  additional  families  have  since  been  re¬ 
ported  and  an  elevated  frequency  of  lobular 
breast  carcinoma  has  been  observed  (71,  96). 
Occasional  CDH1  mutations  in  families  with 
cases  of  lobular  breast  cancer  but  no  gastric  can¬ 
cer  have  also  been  reported  (83).  CDH1  encodes 
E-cadherin,  a  transmembrane  protein  impor¬ 
tant  in  the  maintenance  of  cell  polarity. 

These  conditions  were  identified  because 
the  non-breast-cancer-related  features  were 
sufficiently  distinctive  and  penetrant  to  reli¬ 
ably  ascertain  families  and  assign  affection  sta¬ 
tus,  thus  facilitating  mapping  of  the  causative 
genes  by  linkage  analysis.  The  true  breast  can¬ 
cer  risks  associated  with  mutations  are  unclear 
and  it  is  possible  that  the  published  risks  are  in¬ 
flated  through  the  bias  of  studying  families  with 
prominent  phenotypes.  However,  the  relative 
risks  are  likely  to  be  intermediate  and  in  the 
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range  of  2  to  10.  There  is  no  current  evidence 
that  mutations  in  PTEN,  STK11,  or  CDH1  ac¬ 
count  for  a  substantial  proportion  of  familial  or 
sporadic  breast  cancer  in  the  absence  of  their 
respective  syndromes;  because  these  syndromes 
are  very  rare,  the  attributable  risk  of  mutations 
in  these  genes  to  familial  breast  cancer  is  low 
(14,27,  57,  83,  107). 

Intermediate-Penetrance  Breast 
Cancer  Predisposition  Genes 

Four  intermediate-penetrance  breast  cancer 
genes  have  been  identified  via  mutational 
screening:  CHEK2,  ATM,  BR1P1,  and  PALB2. 
Mutations  in  these  genes  are  rare  and  confer  a 
relative  risk  of  breast  cancer  of  2  to  4.  RAD 50 
may  also  be  an  intermediate-penetrance  breast 
cancer  predisposition  gene,  but  convincing  re¬ 
sults  to  date  pertain  only  to  a  founder  mutation 
detected  in  the  Finnish  population.  The  rar¬ 
ity  of  mutations  and  modest  associated  risks  are 
such  that  the  attributable  risk  of  mutations  in 
these  genes  is  low:  Together  they  account  for 
approximately  2.3  %  of  excess  familial  risk  (98). 

CHEK2.  CHEK2  encodes  CHK2,  a  central 
mediator  of  cellular  response  to  DNA  damage 
that  phosphorylates  both  p53  and  BRCA1  to 
regulate  repair  of  DNA  double-strand  breaks. 
The  llOOdelC  mutation  in  CHEK2  was  first 
reported  after  mutational  screening  of  this  el¬ 
igible  candidate  gene  in  a  single  family  with 
features  of  Li-Fraumeni  syndrome  (13).  How¬ 
ever,  the  population  frequency  of  this  mutation 
was  shown  to  be  approximately  1%  (18/1620), 
demonstrating  that  CHEK2  could  not  be  a 
high-risk  Li-Fraumeni  syndrome  gene.  The 
identification  of  the  CHEK2A  lOOdelC  muta¬ 
tion  in  4.2%  (30/718)  of  breast  cancer  fami¬ 
lies  demonstrated,  instead,  that  CHEK2  was  an 
intermediate-penetrance  breast  cancer  predis¬ 
position  gene  (P  =  5  x  10~6)  (85).  Segregation 
analysis  in  these  families  provided  an  indirect 
estimate  of  1.70  (95%  Cl  =  1.32-2.20)  for  the 
relative  risk  of  breast  cancer  conferred  by  the 
mutation.  Combining  data  from  ten  case  con¬ 


trol  studies,  the  CHEK2  consortium  demon¬ 
strated  the  frequency  of  CHEK2A  lOOdelC  in 
population-based  breast  cancer  cases  to  be 
1.9%  (201/10,860)  compared  with  0.7%  in  con¬ 
trols  (64/9065)  (P  —  1  x  10~7).  These  fre¬ 
quencies  equate  to  a  direct  odds  ratio  of  breast 
cancer  of  2.34  (95%  Cl  =  1.72-3.20)  (26).  It 
has  been  proposed  that  CHEK2 _1  lOOdelC  may 
confer  risks  of  prostate  and  other  cancers  but 
evidence  is  conflicting  (37,  38,  85,  105,  117). 
There  are  also  reports  of  other  rare  truncating 
mutations  in  CHEK2,  which  would  be  antici¬ 
pated  to  have  similar  risks  to  CHEK2A  lOOdelC 
(17,  39,  55, 103, 119). A  number  of  rare  CHEK2 
missense  variants,  such  as  470T/C  (I157T), 
have  been  reported  in  breast  cancer  cases  but 
their  significance  has  yet  to  be  clarified  (2,  17, 
55,  72,  103). 

ATM.  The  proposal  of  ATM  as  a  breast  can¬ 
cer  predisposition  gene  first  came  in  1976  from 
an  epidemiological  study  that  reported  an  ex¬ 
cess  of  breast  cancer  in  female  relatives  of  pa¬ 
tients  with  ataxia  telangiectasia,  an  autosomal 
recessive  syndrome  characterized  by  progres¬ 
sive  cerebellar  ataxia,  immune  deficiency,  and 
cancer  predisposition.  This  astute  observation 
preceded  the  mapping  of  the  gene  by  almost 
two  decades  and  has  been  subsequently  repli¬ 
cated  in  a  number  of  large  epidemiological  stud¬ 
ies  (41,  102,  112,  113).  The  candidacy  of  ATM 
as  a  breast  cancer  predisposition  gene  was  fur¬ 
ther  enhanced  as  the  function  of  the  encoded 
protein  became  apparent:  ATM  occupies  a  cen¬ 
tral  role  in  the  response  to  double-strand  DNA 
breaks  through  initiation  of  a  signaling  cascade 
that  involves  phosphorylation  of  multiple  pro¬ 
teins  including  p53,  BRCA1,  and  CHK2.  Re¬ 
sults  from  initial  mutational  screening  of  ATM 
in  breast  cancer  were  inconclusive  and/or  in¬ 
consistent  but  convincing  proof  was  finally  pub¬ 
lished  in  2006.  Mutations  were  found  in  12/443 
familial  cases  negative  for  mutations  in  BRCA1 
and  BRCA2  and  2/521  controls  (P  =  0.0047). 
These  mutations  comprised  truncations,  splice- 
site  abnormalities,  and  two  missense  mutations 
that  were  known  to  affect  protein  function  and 
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cause  ataxia  telangiectasia.  The  relative  risk  of 
breast  cancer  conferred  by  ATM  mutations  was 
estimated  from  segregation  analysis  to  be  2.37 
(95%  Cl  =  1.51-3.78, P=  0.0003)  (101). This 
indirect  estimate  is  very  similar  to  that  derived 
from  large  epidemiological  studies  (113).  There 
is  some  epidemiological  evidence  that  muta¬ 
tions  in  ATM  may  predispose  to  other  cancers 
but  this  has  yet  to  be  confirmed  or  corroborated 
by  molecular  data  (113). 

BRIP1.  BR1P1  ( BACH1 )  encodes  a  DEAH 
helicase  that  interacts  with  BRCA1  and  has 
BRCA1 -dependent  roles  in  DNA  repair  and 
checkpoint  control  (24,  90).  In  2006,  truncat¬ 
ing  BR1P1  mutations  were  reported  in  9/1212 
index  breast  cancer  cases  from  families  nega¬ 
tive  for  mutations  in  BRCA1  and  BRCA2  com¬ 
pared  with  2/2081  controls,  thus  providing 
convincing  evidence  that  BR1P1  is  a  breast  can¬ 
cer  predisposition  gene  (P  =  0.0030).  Segre¬ 
gation  analysis  found  the  relative  risk  conferred 
by  BR1P1  mutations  to  be  2.0  (95%  Cl  =  1.2- 
3.2,  P  —  0.012)  (104).  Of  the  mutations  re¬ 
ported  in  BR1P1,  approximately  half  are  a  non¬ 
sense  mutation,  2392C/T  (R798X),  whereas 
the  remainder  include  disparate  small  insertions 
and  deletions.  Concurrendy,  it  emerged  that 
biallelic  mutations  in  BR1P1  result  in  Fanconi 
anemia,  although  the  Fanconi  anemia  subtype 
associated  with  BR1P1  (subtype  J)  is  phenotyp- 
ically  distinct  from  that  associated  with  BRCA2 
(subtype  Dl)  and  has  not  been  associated  with 
childhood  solid  tumors  (78,  79,  81). 

PALB2.  Precipitation  of  BRCA2-containing 
complexes  revealed  a  novel  protein  that  was 
shown  to  promote  the  localization  and  stability 
of  BRCA2,  thus  facilitating  BRCA2-mediated 
DNA  repair.  Further  study  of  PALB2  (partner 
and  localizer  of  BRCA2)  revealed  that  knock¬ 
down  of  the  gene  resulted  in  sensitization  of 
cells  to  chromosomal  damage  by  mitomycin  C, 
the  hallmark  of  Fanconi  anemia  (125).  Biallelic 
truncating  mutations  in  PALB2  were  detected 
in  families  with  Fanconi  anemia  not  caused  by 
mutations  in  other  genes  and  this  subtype  was 


designated  FA-N  (100,  124).  The  phenotypes 
of  FA-N  and  Fanconi  anemia  caused  by  BRCA2 
mutations  (FA-D1)  are  very  similar  and  can 
be  distinguished  from  classical  Fanconi  anemia 
on  account  of  a  markedly  elevated  frequency 
of  childhood  solid  tumors  such  as  Wilms  tu¬ 
mor  and  medulloblastoma.  With  the  recent 
precedents  of  both  BR1P1  and  BRCA2  caus¬ 
ing  Fanconi  anemia  in  biallelic  mutation  carri¬ 
ers  and  conferring  susceptibility  to  breast  can¬ 
cer  in  monoallelic  carriers,  the  identification  of 
PALB2  as  a  Fanconi  anemia  gene  further  rec¬ 
ommended  PALB2  as  an  attractive  candidate 
breast  cancer  predisposition  gene. 

Sequencing  of  the  gene  revealed  truncat¬ 
ing  mutations  in  10/923  index  breast  cancer 
cases  from  families  negative  for  mutations  in 
BRCA1  and  BRCA2  compared  with  0/1084  in 
controls  (P  =  0.0004).  Segregation  analysis 
from  these  families  estimated  the  relative  risk  of 
PALB2  mutations  to  be  2.3  (95%  Cl  =  1.4— 3.9, 
P  =  0.0025)  (98).  Mutational  screening  of 
PALB2  in  a  Finnish  series  detected  one  trun¬ 
cating  mutation:  1592delT.  This  mutation 
was  identified  in  3/113  (2.7%)  familial  cases, 
18/1918  (0.9%)  unselected  breast  cancer  cases, 
and  6/2501  (0.2%)  controls.  The  directly  de¬ 
rived  odds  ratio  of  breast  cancer  of  this  mu¬ 
tation  is  therefore  3.94  (95%  Cl  =  1.5-12.1) 
(47).  This  mutation  has  not  been  detected 
in  other  series  and  may  represent  a  Finnish 
founder.  A  French  Canadian  founder  mutation, 
2323  C/T  (Q775X),  has  also  been  reported 
(52). 

RAD50.  The  highly  conserved  MRN  com¬ 
plex  ( MRE1 1 ,  RAD  50,  and  NBS1)  interacts  with 
BRCA1  and  plays  a  central  role  in  DNA  re¬ 
pair.  A  truncating  mutation  in  RAD 50, 657 delT, 
was  demonstrated  in  8/317  consecutively  ascer¬ 
tained  breast  cancer  cases  and  6/1000  controls 
front  Finland  [P  =  0.008,  odds  ratio  (OR)  = 
4.3,  95%  Cl  =  1.5-12.5].  Other  rare  truncat¬ 
ing  mutations  have  been  reported  in  RAD50, 
but  the  contribution  of  RAD50  to  breast  can¬ 
cer  predisposition  outside  of  Finland  requires 
further  clarification  (62,  118). 
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Distinctive  features  of  intermediate- 
penetrance  breast  cancer  predisposition 
genes.  As  in  BRCA1  and  BRCA2,  multi¬ 
ple  different  truncating  mutations  occur  in 
these  intermediate-penetrance  genes,  the 
frequencies  of  which  differ  between  popula¬ 
tions  on  account  of  founder  and  migrational 
effects.  Monoallelic  mutations  in  these  genes 
confer  an  approximately  twofold  increase 
in  the  risk  of  breast  cancer.  Although  there 
is  some  imprecision  in  the  indirect  risk  es¬ 
timates  derived  from  segregation  analysis, 
population-based  series  and  epidemiological 
data  have  provided  some  corroboration  for 
these  figures.  There  is  currently  no  strong 
evidence  that  monoallelic  mutations  in  these 
genes  confer  a  phenotype  beyond  breast 
cancer  predisposition.  Biallelic  mutations  in 
ATM,  BR1P1,  and  PALB2  result  in  severe 
childhood  disorders:  ataxia  telangiectasia  and 
Fanconi  anemia  types  J  and  N,  respectively 
(97).  Because  these  genes  function  in  the  same 
pathways  as  BRCA1  and  BRCA2,  it  remains 
unclear  why  abrogation  of  their  functions 
confers  more  modest  risks  of  breast  cancer. 
However,  the  lower  penetrance  of  these  genes 
means  that  they  deviate  from  some  of  the 
hallmarks  of  pathogenicity  associated  with 
mutations  of  BRCA1  and  BRCA2.  Mutations 
in  these  genes  do  not  show  the  classical, 
near-complete  pattern  of  segregation  of  disease 
with  the  family  mutation.  By  contrast,  because 
breast  cancer  is  a  common  disease,  in  a  typical 
pedigree  an  appreciable  proportion  of  breast 
cancers  arise  by  chance  and  thus  may  manifest 
in  either  mutation  carriers  or  noncarriers. 
Likewise,  one  should  not  anticipate  finding 
the  pattern  of  loss  of  the  wild-type  allele 
detected  at  high  frequency  in  tumors  in  BRCA1 
and  BRCA2  mutation-positive  individuals. 
Firstly,  the  mechanism  of  tumorigenesis  of 
these  genes  is  unknown  and  may  not  require 
inactivation  of  the  wild-type  allele.  Secondly, 
even  if  the  classical,  two-hit  model  of  a  tumor 
suppressor  gene  does  apply,  one  could  not 
expect  to  demonstrate  loss  of  heterozygosity 
in  all  tumors  occurring  in  mutation  carriers 


because  the  mutation  is  not  causally  implicated 
in  a  significant  portion  of  them. 

Low-Penetrance  Breast  Cancer 
Predisposition  Alleles 

There  is  currendy  strong  evidence  for  the  asso¬ 
ciation  with  breast  cancer  of  eight  common  al¬ 
leles,  which  each  confer  a  relative  risk  of  breast 
cancer  of  <1.5.  A  nonsynonymous  variant  in 
CASP8  was  identified  through  a  candidate- 
based  approach  in  a  consortium  comprising  14 
studies  that  compared  17,109  cases  and  16,423 
controls  (34).  Seven  further  variants  have  been 
detected  in  three  recent  genome-wide  associ¬ 
ation  studies  (Table  2).  Easton  and  coworkers 
(46)  performed  a  three-stage  genome-wide  ex¬ 
periment.  In  the  first  stage  a  Perlegen  platform 
was  used  to  study  the  association  of  227,876 
SNPs  in  3  90  cases  with  a  family  history  of  breast 
cancer  and/or  bilateral  disease  and  364  con¬ 
trols  from  the  United  Kingdom.  In  the  second 
stage  12,711  SNPs  (the  most  significant  5%  of 
SNPs  from  stage  1)  were  genotyped  in  3990 
cases  and  3916  UK  controls  using  a  custom- 
designed  array.  In  the  third  stage  the  30  most 
significant  SNPs  were  tested  for  confirmation 
in  21,860  cases  and  22,578  controls  from  22 
international  studies  (46).  The  study  identified 
five  variants  associated  with  breast  cancer  at  a 
significance  level  of  P  <  10~7.  Stacey  and  col¬ 
leagues  (110)  conducted  a  genome-wide  scan 
in  1600  Icelandic  cases  and  11,563  controls  us¬ 
ing  the  Illumina  HumanHap300  platform;  the 
ten  best  associated  SNPs  were  studied  further 
in  five  international  replication  sets  comprising 
2954  cases  and  6014  controls.  This  study  iden¬ 
tified  one  of  the  variants  reported  by  Easton 
and  coworkers  (rs3 803662)  and  a  novel  region 
of  association  on  2q35.  As  part  of  the  National 
Cancer  Institute  Cancer  Genetic  Markers 
of  Susceptibility  (CGEMS)  Project,  Hunter 
and  colleagues  (67)  used  the  Illumina  Hu- 
manHap500  array  to  genotype  1145  post¬ 
menopausal  invasive  breast  cancer  cases  and 
1142  North  American  controls  of  European 
ancestry.  The  six  best-associated  signals  were 
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*  Abbreviations  used:  LD,  linkage  disequilibrium;  MAF,  minor  allele  frequency  in  control  population;  OR,  odds  ratio  (measured  relative  to  common  homozygotes);  Cl,  confidence  interval; 
P-trend,  P  value  for  per  allele  effect  under  multiplicative  model;  GW,  genome-wide  association  study;  Candidate,  association  studies  of  single  nucleotide  polymorphisms  (SNPs)  in  candidate  genes. 
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further  studied  in  1776  cases  and  2072  con¬ 
trols  from  three  North  American  replication 
sets.  The  results  of  this  study  confirmed  the  sig¬ 
nal  in  intron  2  of  FGFR2  reported  by  Easton 
and  coworkers  (67).  The  final  predisposition 
SNP  on  5pl2  was  identified  through  mining 
and  cross-referencing  of  associations  of  border¬ 
line  significance  from  these  three  genome-wide 
association  studies;  this  signal  was  verified  by 
Stacey  and  coworkers  (1 1 1)  in  a  replication  se¬ 
ries  of  5,028  cases  and  32,090  controls. 

10q26  (FGFR2  intron  2).  The  strongest  evi¬ 
dence  for  association  was  for  a  signal  within  in¬ 
tron  2  of  the  gene  encoding  fibroblast  growth 
factor  receptor  2  ( FGFR2 )  [per  allele  OR  = 
1.26  (1.23-1.30)  P  =  2  x  10“76]  (46,  67,  111). 
Fine  association  mapping  of  the  region  of  link¬ 
age  disequilibrium  has  been  undertaken  using 
UK  samples  and  refined  using  an  Asian  series  (in 
which  linkage  disequilibrium  is  weaker).  These 
studies  have  reduced  the  association  to  a  mini¬ 
mum  set  of  six  variants  within  intron  2,  which 
are  too  strongly  correlated  for  their  individual 
effects  to  be  discerned  using  further  genetic  epi¬ 
demiologic  approaches  (46). 

Somatic  FGFR2  mutations  have  been  re¬ 
ported  in  several  cancers  and  result  in  overac¬ 
tivity  of  the  protein.  It  therefore  seems  plausible 
that  the  elevated  breast  cancer  risk  is  somehow 
mediated  through  increased/altered  activity  of 
FGFR2,  but  the  molecular  basis  for  the  associ¬ 
ation  is  currently  unknown. 

The  risk-prevalence  profile  of  this  allele 
means  that  the  power  to  detect  it  in  the  genome¬ 
wide  scans  is  high.  The  corollary  of  this  is  that 
if  other  alleles  of  a  similar  risk-prevalence  pro¬ 
file  exist,  they  should  most  likely  have  been  de¬ 
tected.  Thus,  it  is  possible  that  this  may  be  the 
allele  of  most  substantial  effect  to  exist  within 
this  category  of  risk  alleles.  The  differences 
in  the  risks  conferred  are  nevertheless  modest: 
The  risk  of  breast  cancer  by  age  70  in  individuals 
homozygous  for  the  risk  allele  is  10.5%  com¬ 
pared  with  6.7%  for  heterozygotes  and  5.5% 
for  nonrisk  allele  homozygotes  (based  upon  UK 
breast  cancer  incidence  figures).  However,  the 
risk  allele  is  common:  14%  of  the  UK  popu¬ 


lation  and  19%  of  UK  breast  cancer  cases  are 
homozygous  for  the  risk  allele.  Hence,  this  al¬ 
lele  may  account  for  approximately  1 .9%  of  the 
excess  familial  risk  of  breast  cancer. 

16ql2.  Strong  evidence  for  association  of 
rs3803662  with  breast  cancer  was  reported  in 
two  studies  [per  allele  OR  =  1.20  (1.16-1.24), 
P  =  1  x  1 0-36  in  the  UK  study  and  per  allele 
OR  =  1.28 (1.21-1.35), P=  5.9  x  10-I9inthe 
Icelandic  study]  (46,  110).  This  variant  lies  on 
16ql2  and  tags  a  region  of  linkage  disequilib¬ 
rium  containing  the  5'  end  of  TNRC9  ( TOX3 ) 
and  a  hypothetical  gene  LOC643714.  TNRC9 
is  a  gene  of  uncertain  function  that  contains  a 
putative  high  mobility  group  box  motif,  sug¬ 
gesting  that  it  may  act  as  a  transcription  factor. 
Results  from  a  previous  study  suggested  that 
TNRC9  expression  is  predictive  of  metastasis  of 
breast  cancer  to  bone  (108). 

2q35.  The  Icelandic  study  found  evidence  for 
association  with  breast  cancer  of  a  variant  on 
2q35  [per  allele  OR  =  1.20  (1.14-1.26),  P  = 
1.3  x  10~13].  The  region  of  linkage  disequi¬ 
librium  does  not  contain  any  known  genes. 
The  nearest  genes  are  TNP1  (181  kb),  1GFBP5 
(345  kb),  and  IGFBP2  (376  kb)  upstream  and 
TNS1  (761  kb)  downstream.  Although  this  sig¬ 
nal  was  not  detected  in  the  other  genome-wide 
scans,  we  have  replicated  the  association  in  a 
UK  series  of  familial  breast  cancer  cases  [per 
allele  OR  =  1.17  (95%  Cl  1.07-1.27),  P  = 
0.0004]  (C.  Turnbull  and  N.  Rahman,  unpub¬ 
lished  data)  and  it  is  being  evaluated  by  the 
Breast  Cancer  Association  Consortium. 

5pl2.  Stacey  and  coworkers  (111)  noted  that 
one  of  the  ten  top-ranked  SNPs  in  the  Ice¬ 
landic  genome-wide  study  was  on  chromo¬ 
some  5pl2  (rs7703618).  The  CGEMS  study 
had  reported  an  association  of  borderline  sig¬ 
nificance  on  5pl2  at  rs4866929,  a  SNP  in  tight 
linkage  disequilibrium  with  rs7703618,  while 
Easton  and  coworkers  (46)  had  reported  a  ten¬ 
tative  signal  at  5p45  (rs981782),  371  kb  away, 
which  was  also  in  loose  linkage  disequilibrium 
with  rs7703618  (r2  =  0.10).  Struck  by  the 
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coincidence  of  tentative  signals,  they  examined 
21  SNPs  in  this  region  for  association  with 
breast  cancer  and  found  rsl0941679  to  be  the 
most  significantly  associated  SNP  [per  allele 
OR  1.19  (1.13-1.26),  P  =  2.9  x  10"11].  The 
only  gene  in  the  region  of  linkage  disequilib¬ 
rium  is  MRPS30  ( PDCD9 ,  programmed  cell 
death  protein  9),  which  encodes  a  component 
of  the  small  subunit  of  the  mitochondrial  ribo¬ 
some  and  has  been  implicated  in  apoptosis. 

5qll.  Association  has  been  found  for  rs8893 1 2 
[per  allele  OR  =  1.13  (1.10-1.16),  P  =  7  x 
10-20].  This  SNP  tags  a  280-kb  block  of  linkage 
disequilibrium  on  5ql  1  that  contains  the  genes 
MAP3K1  ( MEKK ),  MGC33648,  and  MIER3. 
MAP3K1  is  the  most  plausible  candidate  gene 
therein  and  encodes  mitogen-activated  protein 
kinase  kinase  kinase  1 ,  which  is  involved  in  cell 
signaling  (46). 

2q33  (CASP8  D302H).  This  association  was 
ascertained  through  the  biological  candidacy  of 
CASP8,  which  encodes  caspase-8,  a  protein  in¬ 
volved  in  apoptosis  [per  allele  OR  =  0.88  (0.84- 
0.92),  P  =  1.1  x  10~7]  (34).  However,  the  re¬ 
gion  of  linkage  disequilibrium  tagged  by  this 
SNP  is  approximately  290  kb  in  length  and  in¬ 
cludes  a  number  of  other  genes:  TRAK2  (en¬ 
coding  trafficking  protein  kinesin  binding  2), 
three  genes  identified  as  candidates  for  juve¬ 
nile  amyotrophic  lateral  sclerosis  2  ( ALS2CR1 2, 
ALS2CR2,  and  ALS2CR1 1),  and  two  hypothet¬ 
ical  genes  ( LOC389286  and  LOC729191).  It  is 
currently  unclear  whether  D302H  is  the  causal 
variant  or  whether  it  tags  a  distinct  causal  vari¬ 
ant,  which  may  or  may  not  mediate  its  effect 
through  CASP8.  The  relatively  low  minor  al¬ 
lele  frequency  and  risk  of  this  allele  mean  that 
the  power  to  detect  its  signal  at  the  genome¬ 
wide  level  is  comparatively  low.  This  is  con¬ 
sistent  with  failure  of  the  allele  to  reach  the 
arbitrary  significance  thresholds  required  for 
further  investigation  in  the  genome-wide  asso¬ 
ciation  scans  performed  to  date.  It  is  also  con¬ 
sistent  with  the  presumed  existence  of  many 
further  alleles  of  comparable  risk-prevalence 
profiles. 


8q24.  Association  was  found  for  a  SNP  within 
an  110-kb  block  of  linkage  disequilibrium  on 
8q24,  which  contains  no  known  genes  [per  al¬ 
lele  OR  =  1.08  (1.05-1.11),  P  =  5  x  10~12] 
(46).  It  is  interesting  that  in  the  first  wave  of 
genome-wide  association  studies  in  common 
cancers,  8q24  has  also  yielded  multiple  inde¬ 
pendent  prostate  cancer  loci,  one  of  which  is 
also  a  colon  cancer  risk  allele  (rs6983267)  (59, 
60).  The  tag  SNP  associated  with  breast  can¬ 
cer  does  not  demonstrate  association  with  colon 
cancer  or  prostate  cancer  and  is  60  kb  proximal 
to  rs6983267.  Clustering  of  these  predisposi¬ 
tion  alleles  may  be  a  coincidence  or  may  indi¬ 
cate  a  common  or  related  mechanism  of  cancer 
predisposition.  The  nearest  gene  to  the  breast 
cancer  predisposition  locus  is  MYC  and  it  is 
plausible  that  the  susceptibility  occurs  through 
some  unknown  mechanism  of  activation  of  this 
oncogene. 

Ilpl5.  Another  associated  tag  SNP  lies  in 
intron  10  of  LSP1  ( WP43 ),  which  encodes 
lymphocyte-specific  protein  1  [per  allele  OR  = 
1.07  (1.04-1.11),  P  =  3  x  10  9],  an  F-actin 
bundling  cytoskeleton  protein  that  is  expressed 
in  hematopoetic  and  endothelial  cells  (46). 
Other  genes  within  this  region  of  linkage  dis¬ 
equilibrium  include  TNNT3,  troponin  T  type 
3;  MRPL23,  mitochondrial  ribosomal  protein; 
HI  9,  an  imprinted,  maternally  expressed,  un¬ 
translated  mRNA;  and  LOC728008,  a  hypo¬ 
thetical  gene. 

Distinctive  features  of  low-penetrance 
breast  cancer  predisposition  alleles.  A  fasci¬ 
nating  aspect  of  the  recent  genome-wide  scans 
has  been  the  opaqueness  of  the  relationship 
of  the  identified  variants  with  known  protein¬ 
coding  genes.  The  patterns  of  linkage  disequi¬ 
librium  suggest  that  the  causal  variants  need 
not  lie  in  the  coding  region  of  a  gene,  as  evi¬ 
denced  by  the  predisposition  variants  at  2q  and 
8q  that  are  tens  of  kilobases  away  from  the  near¬ 
est  protein-encoding  gene.  Any  significance  to 
the  roles  of  genes  upstream  or  downstream  of 
the  associated  blocks  of  linkage  disequilibrium 
is  currently  speculative. 
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The  interpretation  of  subgroup  analyses  of 
the  immunohistopathologic  phenotypic  varia¬ 
tion  of  risk  alleles  requires  caution  on  account  of 
the  limited  power  of  some  analyses.  Initial  anal¬ 
yses  by  the  Icelandic  group  (110,  11 1),  on  iden¬ 
tification  of  the  predisposition  SNPs  at  16ql2, 
2q35,  and  5pl2,  showed  the  effects  of  these 
alleles  to  be  confined  to  ER-positive  tumors. 
Although  subsequent  analyses  of  the  SNPs  at 
10q26,  16ql2,  8q24,  5qll,  and  llpl5  from 
the  United  Kingdom  genome-wide  association 
data  also  revealed  that  for  all  five  alleles,  the  es¬ 
timates  of  effect  were  stronger  in  ER-positive 
than  ER-negative  disease,  the  difference  was 
only  significant  for  those  at  10q26  and  8q24. 
These  two  SNPs  were  also  independently  asso¬ 
ciated  with  a  lower  grade  of  disease.  Adjusted 
analyses  of  these  five  SNPs  did  not  reveal  sig¬ 
nificant  association  with  lymph  node  status  nor 
survival  (53). 

Because  these  alleles  occur  at  a  high  fre¬ 
quency  in  the  general  population,  their  pop¬ 
ulation  attributable  risks  (etiologic  fractions) 
are  relatively  high  (13  % — 16%  for  the  alleles  of 
stronger  effects)  (67,  110).  However,  this  figure 
represents  only  the  proportion  of  breast  can¬ 
cer  cases  in  which  the  variant  has  played  some 
causal  role  in  development  of  disease.  The  as¬ 
sociated  risks  are  low  and  it  is  estimated  that  the 
five  loci  characterized  by  Easton  and  cowork¬ 
ers  (46)  account  for  a  modest  3 .6%  of  the  excess 
familial  risk  of  breast  cancer  in  European  pop¬ 
ulations. 

INTERACTIONS  BETWEEN 
BREAST  CANCER 
PREDISPOSITION  FACTORS 

Interaction  is  an  important  aspect  of  risk  cal¬ 
culation,  particularly  in  familial  disease  clus¬ 
ters  in  which  multiple  predisposition  factors  are 
likely  active.  The  combined  risk  of  two  fac¬ 
tors  is  strongly  dependent  upon  the  nature  of 
the  interaction  between  them.  Typically  the  de¬ 
fault  model  assumes  that  they  act  independently 
and  multiplicatively  (as  per  terms  in  a  multi¬ 
ple  regression  analysis).  The  true  situation  may 
be  more  complex  and  comprise  a  mixture  of 


multiplicative,  additive,  antagonistic,  synergis¬ 
tic,  and/or  complex  intermediate  interactions. 

The  study  of  co-occurrence  of  genetic  pre¬ 
disposition  factors  and  exposition  of  gene- 
gene  interactions  has  become  more  viable  since 
the  identification  of  convincing  common  pre¬ 
disposition  alleles.  Preliminary  analyses  from 
genome-wide  association  studies  suggest  that 
the  low-penetrance  risk  alleles  act  multiplica¬ 
tively  with  each  other  (46,  110).  However,  as¬ 
sociation  studies  of  the  SNPs  at  10q26  and 
5qll  performed  within  BRCA1  and  BRCA2 
mutation-positive  families  suggest  that  these 
SNPs  confer  additional  risk  in  the  presence  of 
BRCA2  but  not  BRCA1  mutations  (12).  This 
interesting  disparity  may  in  part  reflect  the  as¬ 
sociation  of  the  SNPs  with  ER-positive  tumors 
because  BRCA1  is  typically  associated  with  ER- 
negative  tumors.  In  a  recent  candidate-based 
experiment,  it  was  reported  that  homozygos¬ 
ity  for  a  variant  in  the  5'  untranslated  region  of 
RAD 51  confers  an  increased  risk  of  breast  can¬ 
cer  to  BRCA2  mutation  carriers  [hazard  ratio  = 
3.18  (95%  Cl  =  1.39-7.27)].  The  modify¬ 
ing  effect  was  not  significant  in  BRCA1  mu¬ 
tation  carriers  or  when  only  a  single  copy  of 
the  risk  allele  was  present  (11).  By  contrast, 
CHEK2A  lOOdelC  has  not  been  shown  to  con¬ 
fer  an  elevated  risk  on  the  background  of  mu¬ 
tations  in  either  BRCA1  or  BRCA2  (85).  It  has 
been  proposed  that  this  reflects  the  common 
pathway  of  the  encoded  proteins;  abrogation  of 
CHK2  function  might  have  little  additional  im¬ 
pact  on  a  pathway  already  radically  subverted 
by  a  mutation  in  BRCA1  or  BRCA2.  However, 
there  is  currently  no  biological  proof  of  this  hy¬ 
pothesis.  Further  analyses  are  required  to  estab¬ 
lish  whether  similar  interactions  are  observed 
for  ATM,  BRIP1,  and  PALB2,  which  also  inter¬ 
act  with  BRCA1  and/or  BRCA2  in  DNA  repair 
pathways. 

Clarity  regarding  gene-environment  inter¬ 
actions  is  even  more  limited.  Recognized  en¬ 
vironmental  risk  factors  for  breast  cancer  in 
the  general  population  are  well  established 
and  predominantly  relate  to  estrogen  expo¬ 
sure.  Endogenous  estrogen-related  risk  factors 
include  timing  of  menarche  and  menopause, 
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parity,  age  of  first  live  birth,  and  breast-feeding, 
whereas  exogenous  factors  include  adminis¬ 
tration  of  contraceptives  and  hormone  re¬ 
placement  therapy  (30,  31,  33).  Studies  of 
hormonal/reproductive  factors  in  BRCA1  and 
BRCA2  mutation  carriers  are  challenging,  not 
just  in  the  assembly  of  sufficient  families,  but 
because  of  the  biases  inherent  in  observa¬ 
tion  of  the  behaviors  of  a  group  of  individ¬ 
uals  who  know  themselves  to  be  at  elevated 
risk.  However,  well-powered  studies  of  gene- 
environment  interactions  are  becoming  possi¬ 
ble  through  collaboration  and  are  allowing  the 
effects  of  BRCA1  and  BRCA2  to  be  studied 
independently.  Significant  alteration  of  breast 
cancer  risk  in  BRCA1  and  BRCA2  mutation  car¬ 
riers  has  been  demonstrated  for  oral  contra¬ 
ceptive  usage,  age  at  first  pregnancy,  and  de¬ 
gree  of  parity,  whereas  significant  effects  were 
not  demonstrated  for  age  of  menarche,  age 
of  natural  menopause,  nulliparity,  and  breast 
feeding  (3,  23,  25).  These  observations  war¬ 
rant  further  investigation  and  confirmation  in 
prospective  studies.  Other  large  studies  will  be 
required  to  explore  gene-environment  interac¬ 
tions  for  genetic  predisposition  factors  of  lower 
penetrance. 

IDENTIFICATION  OF  FURTHER 
GENETIC  BREAST  CANCER 
PREDISPOSITION  FACTORS 

There  is  a  clear  discontinuity  in  the  risks  asso¬ 
ciated  with  the  three  categories  of  breast  can¬ 
cer  predisposition  factors  identified  to  date: 
The  high-penetrance  genes  confer  a  risk  that 
is  elevated  >  10-fold,  the  known  intermediate- 
penetrance  genes  2-4-fold,  and  the  low- 
penetrance  alleles  <1. 5-fold.  There  may  be 
some  biological  significance  to  these  strata  or 
they  may  be  an  artifact  of  the  limited  meth¬ 
ods  of  ascertainment.  More  than  70%  of  ge¬ 
netic  predisposition  to  breast  cancer  remains 
unaccounted  for  and  as  further  genetic  predis¬ 
position  factors  are  identified,  new  categories 
may  emerge  and/or  the  apparent  distinctions 
between  extant  classes  may  blur  or  disappear. 
Nevertheless,  risk  and  prevalence  provide  a  use¬ 


ful  framework  for  considering  how  technolog¬ 
ical  and  intellectual  advances  may  allow  exten¬ 
sion  of  the  repertoire  of  breast  cancer  predis¬ 
position  factors. 

High-Penetrance  Genes 

Any  further  high-penetrance  dominant  predis¬ 
position  genes  are  likely  to  be  very  rare  causes 
of  familial  breast  cancer.  Segregation  analyses 
generated  no  evidence  for  further  dominant 
genes  of  a  risk-penetrance  profile  comparable 
to  BRCA1  or  BRCA2  and  this  has  been  corrobo¬ 
rated  by  linkage  studies.  In  a  recently  published 
example,  genome-wide  linkage  analysis  was  un¬ 
dertaken  in  149  nonsyndromic  breast  cancer 
families  negative  for  mutations  in  BRCA1  and 
BRCA2  and  the  number  of  linkage  peaks  de¬ 
tected  under  parametric  (dominant  and  reces¬ 
sive)  and  nonparametric  (allele-sharing)  models 
did  not  differ  significantly  from  that  expected  by 
chance  (109).  However,  linkage  analyses  such 
as  this  have  been  undertaken  in  families  largely 
ascertained  on  the  basis  of  multiple-generation 
breast  cancer  pedigrees.  A  highly  penetrant  re¬ 
cessive  predisposition  gene  would  not  produce 
this  pattern  of  disease  in  families.  Two  inde¬ 
pendent  segregation  analyses  found  evidence 
of  a  recessive  pattern  of  inheritance  (8,  35). 
Thus,  there  may  be  utility  in  genome-wide  re¬ 
cessive  linkage  analyses  in  more  suitable  series, 
in  particular  families  from  understudied  popu¬ 
lation  isolates,  and/or  in  families  with  higher 
levels  of  consanguinity.  Further,  unidentified 
breast  cancer-associated  syndromes  may  also 
exist.  However,  if  these  syndromes  have  not  yet 
come  to  medical/scientific  attention,  they  are 
likely  to  be  very  rare;  linkage  is  likely  to  repre¬ 
sent  the  optimal  method  for  mapping  any  such 
underlying  predisposition  genes. 

Intermediate-Penetrance  Genes 

Further  intermediate-penetrance  breast  cancer 
predisposition  genes  likely  exist,  although  it  is 
difficult  to  predict  how  many  there  may  be 
and  what  proportion  of  the  total  excess  famil¬ 
ial  risk  is  attributable  to  them.  Resequencing 
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is  currently  the  optimal  approach  by  which 
to  identify  genes  of  this  nature.  The  known 
intermediate-penetrance  genes  are  all  involved 
in  DNA  repair  and  function  in  pathways 
with  BRCA1  and/or  BRCA2;  this  common¬ 
ality  may  reflect  the  underlying  biology  of 
this  class  or  may  just  be  an  artifact  of  the 
groups  of  genes  that  have  been  most  in¬ 
tensely  investigated.  Further  genes  involved 
in  DNA  repair  and  other  relevant  path¬ 
ways  represent  plausible  candidates  and  are 
being  investigated  by  us  and  other  groups. 
However,  technological  advances  are  facilitat¬ 
ing  increasingly  high-throughput  mutational 
screening  such  that  genome-wide  resequencing 
is  becoming  viable.  This  will  allow  interroga¬ 
tion  of  regions  of  the  genome  not  previously  in¬ 
vestigated  and  may  reveal  further  intermediate- 
penetrance  genes  that  function  in  pathways  not 
predictable  from  current  paradigms. 

Intermediate-penetrance  predisposition  to 
breast  cancer  may  also  be  an  unrecognized  com¬ 
ponent  of  known  pleomorphic  cancer  predis¬ 
position  syndromes.  Large,  collaborative  epi¬ 
demiological  studies  of  rare  syndromes  can 
optimize  power  and  minimize  bias  and  may 
represent  the  optimal  strategy  by  which  to  es¬ 
tablish  accurate  estimates  of  these  breast  can¬ 
cer  risks.  Epidemiological  studies  of  reces¬ 
sive  syndromes,  particularly  those  associated 
with  childhood  cancer,  may  reveal  elevated  fre¬ 
quency  of  breast  cancers  in  relatives  of  affected 
individuals  and  may  lead  to  the  identification  of 
further  intermediate-penetrance  breast  cancer 
genes,  as  in  the  case  of  ATM. 

Low-Penetrance  Variants 

Common  low-penetrance  variants.  Ex¬ 
tant  genome-wide  association  data  offer 
clear  evidence  that  many  further  common 
low-penetrance  breast  cancer  predisposition 
variants  exist.  Further  risk  alleles  may  be  iden¬ 
tified  from  these  data  by  mining  the  variants 
of  borderline  significance  for  further  signals 
and  through  the  use  of  imputation  techniques. 
However,  comprehensive  study  of  this  (likely) 
extensive  repertoire  of  low-penetrance  alleles 


will  require  further,  larger,  genome-wide 
association  experiments.  The  effect  sizes  of 
subsequent  rounds  of  risk  alleles  are  likely  to 
be  of  progressively  diminishing  magnitude 
and  will  require  commensurate  increases  in 
power  for  detection.  It  is  currently  unclear 
what  proportion  of  common  low-penetrance 
alleles  will  be  detectable  by  the  feasible  studies 
in  the  immediate  future. 

Strongly  associated  tag  SNPs  from  genome¬ 
wide  scans  may  be  utilized  for  genetic  epidemi¬ 
ologic  analyses  and  clinical  risk  estimation  in 
their  own  right.  However,  the  way  in  which 
these  common  variants  contribute  to  cancer  is 
largely  unknown  and  exploration  of  the  biolog¬ 
ical  mechanisms  that  underlie  these  signals  of¬ 
fers  exciting  new  avenues  of  study.  Resequenc¬ 
ing  and  fine-association  mapping  of  the  region 
tagged  by  a  reporter  SNP  can  refine  the  asso¬ 
ciation  to  a  minimum  set  of  SNPs  in  a  fixed 
block  of  linkage  disequilibrium.  The  cancer- 
causing  components  within  these  blocks  are 
cryptic.  Attempts  to  identify  them  may  chal¬ 
lenge  current  paradigms  of  the  relationship  be¬ 
tween  genes  and  disease  and  require  innovative 
methods. 

To  date,  genome-wide  studies  have  been  un¬ 
dertaken  in  simple  series  of  breast  cancer  cases 
of  European  descent.  Novel  risk  alleles  may 
be  identifiable  through  genome-wide  studies 
of  different  ethnic  groups  in  which  the  minor 
allele  frequencies  and  phenotypic  spectrum  of 
disease  differ  from  Europeans.  For  example,  a 
risk  locus  at  6q22  was  identified  in  a  recent 
genome-wide  study  performed  exclusively  in 
Ashkenazi  Jewish  breast  cancer  cases  and  con¬ 
trols;  verification  of  this  association  in  other 
populations  is  awaited  (54).  Broader  insight  into 
genetic  etiology  may  be  gained  through  the  ex¬ 
pansion  of  genome-wide  studies  into  subgroups 
of  breast  cancer  cases.  For  example,  analysis  of 
BRCA- positive  individuals  may  allow  identifi¬ 
cation  of  modifier  alleles.  Studies  focusing  on 
well-characterized  histopathological  subgroups 
of  breast  cancers  may  advance  our  understand¬ 
ing  of  the  genetic  basis  of  disease  heterogene¬ 
ity.  The  use  of  quantitative  intermediate  phe¬ 
notypes,  such  as  mammographic  density,  as  the 
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outcome  for  genome-wide  studies  may  also  rep¬ 
resent  an  alternative  approach. 

Rare  low-penetrance  variants.  The  risk- 
prevalence  profiles  already  identified  suggest 
that  rare  variants  of  low  penetrance  represent 
another  plausible  category  of  breast  cancer  pre¬ 
disposition  factor.  However,  there  are  currently 
no  reliable  methods  by  which  to  quantify  the 
risk  of  breast  cancer  associated  with  individual, 
rare  variants.  Therefore,  although  it  is  possi¬ 
ble  that  some  of  the  rare  variants  detected  dur¬ 
ing  mutational  screening  experiments  are  low- 
penetrance  predisposition  factors,  our  ability  to 
demonstrate  this  is  limited.  Genome-wide  re¬ 
sequencing  will  inevitably  detect  many  further 
rare  variants,  some  of  which  will  likely  be  asso¬ 
ciated  with  small  increases  in  risk  of  breast  can¬ 
cer.  Developing  robust  methods  for  the  iden¬ 
tification  of  the  cancer-associated  rare  variants 
among  the  numerous  innocuous  variants  will  be 
a  major  challenge  for  the  future. 

Optimizing  the  Power  of  Gene 
Identification  Studies 

A  priority  for  all  future  studies  will  likely  be 
the  harnessing  of  sufficient  statistical  power  at 
acceptable  cost.  An  important  tool  for  gain¬ 
ing  power  without  increasing  the  experiment 
size/cost  is  to  assemble  a  case  series  enriched  for 
genetic  predisposition.  The  best-recognized  of 
these  enrichment  parameters,  successfully  used 
in  many  experiments  to  date,  include  family 
history,  bilaterality,  and  early  age  of  onset  of 
disease.  Their  relative  efficacies  have  been  com¬ 
pared  through  modeling  the  sample  size  re¬ 
quired  to  demonstrate  association  with  disease 
of  a  variant  of  specified  frequency  and  effect. 
The  use  of  cases  with  a  single  affected  first- 
degree  relative  affords  a  greater  than  twofold 
reduction  in  the  required  sample  size;  for  cases 
with  two  affected  first-degree  relatives  the  re¬ 
duction  is  more  than  fourfold.  The  sample  size 
reduction  when  using  bilateral  cases  is  also  four¬ 
fold  (and  thus  equivalent  to  using  cases  with  two 
first-degree  relatives).  However,  the  sample  size 
required  for  an  association  study  that  uses  cases 


diagnosed  at  age  35  is  only  40%  less  than  one 
that  uses  cases  diagnosed  at  65  years.  Relative 
efficiencies  are  all  magnified  for  experiments  in 
which  rarer  alleles  are  studied  (6). 

An  alternative  strategy  is  geographical  en¬ 
richment,  namely  to  first  screen  genes  in  pop¬ 
ulation  isolates  in  which  a  higher  prevalence  of 
founder  mutations  might  be  anticipated,  such  as 
the  Ashkenazim,  the  Finnish,  or  the  Icelanders. 
This  has  proved  very  successful  for  the  iden¬ 
tification  and  characterization  of  known  genes. 
However,  as  is  currently  the  case  for  RAD50,  the 
generalizability  of  findings  from  specific  popu¬ 
lations  may  prove  challenging. 

There  has  been  longstanding  expectation 
that  evolution  in  molecular  profiling  may  re¬ 
sult  in  pathological  classifications  that  more 
directly  reflect  the  genetic  etiology  of  tu¬ 
mors.  Although  the  basal  phenotype  clearly  en¬ 
riches  for  cases  that  arise  because  of  BRCA1 
mutations,  it  remains  unclear  whether  novel 
immunohistopathological  profiles  or  other 
phenotypic  surrogates  may  emerge  that  can  dis¬ 
tinguish  further  subsets  of  breast  cancers  that 
occur  because  of  genetic  predisposition. 

CLINICAL  TRANSLATION 
Risk  Estimation  and  Management 

Risk  estimation  is  currently  the  primary  clini¬ 
cal  application  for  genetic  factors  that  predis¬ 
pose  to  breast  cancer.  Ongoing  improvement 
of  clinical  risk  assessment  tools  is  necessary  to 
ensure  that  individuals  at  truly  elevated  risk 
are  identified  so  that  they  might  benefit  from 
advances  in  surveillance  techniques  and  pro¬ 
phylactic  interventions.  Advances  in  the  under¬ 
standing  of  the  polygenic  basis  of  breast  cancer 
may  have  different  effects  on  three  groups:  fam¬ 
ilies  positive  for  mutations  in  BRCA 1  or  BRCA2 , 
other  breast  cancer  families,  and  the  general 
population. 

Risk  estimation  in  BRCA-positive  breast 
cancer  families.  Detection  in  a  family  of  a  mu¬ 
tation  in  BRCA1  or  BRCA2  has  afforded  relative 
clarity  in  risk  estimation.  Unaffected  females, 
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at  high  prior  risk  of  cancer  on  account  of  their 
family  history,  can  undertake  predictive  testing 
to  determine  whether  they  carry  the  high-risk 
mutation.  Risk-reducing  prophylactic  surgery 
and/or  intensive  surveillance  may  be  offered  to 
mutation  carriers  whereas  noncarriers  can  be 
reassured  that  their  risk  is  not  as  high.  How¬ 
ever,  studies  have  found  wide  variation  in  the 
penetrance  of  mutations  in  BRCA1  and  BRCA2 
(see  above)  and  this  has  raised  questions  regard¬ 
ing  the  appropriate  estimates  of  risk  for  clinical 
usage.  Some  of  this  apparent  variation  in  pen¬ 
etrance  between  families  may  be  the  result  of 
differing  doses  of  additional  lower-risk  modi¬ 
fying  variants;  recently  discovered  examples  in¬ 
clude  SNPs  in  RAD51,  FGFR2,  and  at  5q  (11, 
12).  Thus,  one  of  the  early  clinical  applications 
of  low-penetrance  alleles  may  be  to  offer  in¬ 
dividualized  refinement  of  risk  to  BRCA1  and 
BRCA2  mutation  carriers. 

Risk  estimation  in  BRCA-negative  breast 
cancer  families.  The  maj  ority  of  breast  cancer 
families  do  not  harbor  mutations  in  BRCA1  or 
BRCA2.  Clinical  breast  cancer  risk  estimation 
in  these  families  is  currendy  based  empirically 
upon  family  history  of  cancer.  For  unaffected 
individuals  in  breast  cancer  families  negative 
for  BRCA1  and  BRCA2  mutations,  it  would  be 
clinically  useful  to  have  better  discrimination 
of  risk  to  distinguish  high-risk  individuals  in 
whom  radical  surgical  prophylaxis  may  be  jus¬ 
tified  and  individuals  at  lower/population  risk 
who  could  be  spared  some  anxiety  and  inter¬ 
vention.  It  remains  unclear  whether  the  un¬ 
derlying  genetic  architecture  is  such  that  this 
discrimination  could  be  provided  by  genotyp- 
ing  multiple  genetic  predisposition  factors  (pre¬ 
suming  sufficient  numbers  were  identified).  If 
the  clustering  of  breast  cancer  in  a  family  has 
occurred  because  of  multiple  low-penetrance 
genetic  factors,  typing  of  these  factors  would 
result  in  many  possible  risk  categories.  Geno- 
typing  may  place  the  majority  of  unaffected 
members  of  the  family  into  intermediate  risk 
categories  that  differ  little  from  their  risk  based 
on  family  history.  If  family  clustering  is  the 
result  of  fewer  factors  of  greater  penetrance, 


genotyping  these  may  adjust  risk  estimation  suf¬ 
ficiently  to  alter  clinical  management  for  unaf¬ 
fected  individuals. 

Risk  estimation  in  the  general  population. 

By  contrast,  in  the  absence  of  a  family  history 
of  breast  cancer,  genotyping  is  the  only  means 
of  identifying  individuals  at  increased  genetic 
risk.  The  discrimination  of  risk  that  could  be  af¬ 
forded  by  population-level  genotyping  of  com¬ 
mon  low-penetrance  variants  is  influenced  by 
two  factors:  firstly,  the  true  extent  to  which 
breast  cancer  risk  varies  across  the  population 
(which  is  currently  unclear)  and  secondly,  the 
proportion  of  the  risk  variants  that  is  available 
for  genotyping  (8,  9,  94).  Although  the  logisti¬ 
cal,  ethical,  social,  and  economic  considerations 
are  numerous,  in  principle  it  seems  plausible 
that  population-based  genotyping  could  even¬ 
tually  be  used  to  aid  stratification  and  resource 
allocation.  Surveillance,  primary  antiestrogen 
chemoprophylaxis,  prophylactic  surgery,  and 
other  emerging  interventions  may  be  appropri¬ 
ate  for  the  high-risk  upper  tail  of  the  popula¬ 
tion.  The  low-risk  tail  may  require  little  or  no 
additional  intervention. 

Targeted  Therapies 

Understanding  of  the  biological  mechanisms 
underlying  breast  cancer  predisposition  genes 
is  beginning  to  offer  exciting  opportunities 
for  new  therapies.  The  roles  of  BRCA1  and 
BRCA2  in  DNA  repair  and  the  resultant  sen¬ 
sitivity  of  BRCA1-  and  BRCA2 -deficient  cells 
to  agents  that  cross-link  DNA  are  being  ex¬ 
ploited  by  studies  in  which  mutation  carriers 
are  treated  with  platinum-based  drugs.  The  in¬ 
herent  DNA  repair  defect  in  BRCA-deficient 
cells  has  also  provided  a  rationale  for  a  further 
therapeutic  approach.  Poly  (ADP-ribose)  poly¬ 
merase  (PARP)  is  an  enzyme  involved  in  base 
excision  repair.  Inhibition  of  PARP  results  in 
an  increase  in  DNA  lesions  that  are  normally 
repaired  through  homologous  recombination, 
which  requires  BRCA1  and  BRCA2.  In  a  back¬ 
ground  deficient  for  either  BRCA1  or  BRCA2 
protein,  cells  are  profoundly  sensitive  to 
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inhibition  by  PARP,  which  results  in  cell  cycle 
arrest,  chromosome  instability,  and  cell  death. 
Thus  in  BRCA  mutation  carriers,  PASP  in¬ 
hibitors  are  synthetically  lethal  to  tumor  cells 
but  confer  no  demonstrable  toxicity  to  normal 
heterozygous  cells.  Studies  are  also  underway 
to  investigate  whether  PARP  inhibitors  have 
similar  effects  in  individuals  with  basal  tumors 
similar  to  those  occurring  in  BRCA1  mutation 
carriers  and/or  cancers  associated  with  defects 
in  other  DNA  repair  proteins  (50,  84).  Under¬ 
standing  the  molecular  basis  of  the  increased 
cancer  risk  associated  with  low-penetrance  al¬ 
leles  is  in  its  infancy.  However,  this  understand¬ 
ing  may  offer  opportunities  for  novel  therapies. 
For  example,  if  some  risk  alleles  drive  tumori- 
genesis  through  the  upregulation  of  oncogenes, 
such  as  FGFR2  and/or  MYC,  these  genes  may 
represent  potential  new  targets  for  therapeutic 
interventions. 


CONCLUSION 

Although  recent  breakthroughs  have  resulted  in 
the  identification  of  distinct  new  groups  of  ge¬ 
netic  breast  cancer  predisposition  factors,  more 
than  70  %  of  the  genetic  predisposition  to  breast 
cancer  remains  unexplained.  Technologies  are 
emerging  rapidly  that  may  allow  us  to  iden¬ 
tify  many  further  predisposition  factors  within 
these  classes  and  of  other  risk-prevalence  pro¬ 
files.  The  identification  of  novel  predisposition 
factors  offers  exciting  challenges,  but  ongoing 
clarification  and  characterization  of  known  ge¬ 
netic  risk  factors  is  also  important.  A  formidable 
and  ongoing  challenge  is  to  marshal  disparate 
experimental  results  to  capture,  validate,  qual¬ 
ify,  and  organize  into  an  integrated  schema  all 
the  emerging  components  of  risk.  This  is  par¬ 
ticularly  important  if  we  are  to  advance  clinical 
risk  estimation  to  optimally  apportion  surveil¬ 
lance  and  prophylactic  interventions. 


SUMMARY  POINTS 

1 .  Multiple  strategies  have  been  used  to  identify  breast  cancer-predisposing  genetic  factors. 

2.  Linkage  analysis  has  been  successful  in  mapping  high-penetrance  genes  such  as  BRCA1 
and  BRCA2. 

3.  Mutational  screening  of  candidate  genes  has  been  used  to  identify  genes  such  as  CFIEK2 , 
ATM,  BR1P1 ,  and  PALB2,  mutations  in  which  are  rare  and  confer  intermediate  penetrance 
of  breast  cancer. 

4.  Genome-wide  association  studies  have  revealed  several  low-penetrance  alleles. 

5.  More  than  70%  of  breast  cancer  predisposition  remains  unexplained. 

6.  The  outstanding  predisposition  is  likely  to  be  polygenic  and  may  include  multiple  fur¬ 
ther  common  low-penetrance  alleles,  rare  intermediate-penetrance  genes,  and  rare  low- 
penetrance  alleles. 

7.  BRCA1  and  BRCA2  mutation  testing  allows  identification  of  individuals  at  elevated  risk 
of  breast  cancer  who  can  be  offered  risk-reducing  interventions. 

8.  Targeted  therapies  are  being  developed  that  exploit  the  biological  functions  of  BRCA1 
and  BRCA2. 


FUTURE  ISSUES 

1.  Further  genome-wide  association  studies  are  required  to  identify  further  common 
variants. 
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2.  Genome-wide  resequencing  is  likely  to  detect  numerous  novel  rare  variants,  some  of 
which  may  predispose  to  breast  cancer. 

3.  Understanding  how  novel  variants  result  in  breast  cancer  predisposition  may  require 
innovative  strategies. 

4.  Quantification  of  gene-gene  interactions  and  gene-environment  interactions,  which  may 
be  heterogeneous  in  nature,  may  be  used  to  improve  risk  estimation. 

5.  Judicious  clinical  translation  of  genetic  factors  in  addition  to  BRCA1  and  BRCA2  may 
assist  in  risk  estimation,  optimization  of  management,  and  development  of  therapies. 
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